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Preface 


Until  the  past  decade  the  term  '‘system  as  used  in  the  many  disciplines  of 
science  and  engineering  conveyed  a variety  of  meaning.  Since  the  early  1960’s 
theorists  have  been  folding  these  varied  impressions  into  the  single,  formidable 
conceptual  framework  of  linear  system  theory.  The  discussions  of  this  volume 
are  intended  to  lend  to  this  unification.  Specifically,  the  intent  is  to  draw  on  the 
conceptual  ideas  that  are  common  to  linear  filters,  linear  time-invariant  systems, 
Markov  chains  and  quantum  theory  and  present,  in  a non-rigorous  manner,  their 
respective  similarities.  ^ 

Chapter  1 deals  wi^  the  fundamental  concepts  upon  which  the  definition 
of  a system  can  be  established.  Such  notions  as  ordered  pairs,  oriented  abstract 
objects,  etc.,  serve  as  the  basics  of  the  definition.  The  system  response  is  defined 
in  terms  of  an  input-output-state  relationship  which  must  satisfy  certain  con- 
sistency conditions. 

The  system  definition  of  Chapter  1 is  built  upon  in  Chapters  2,  3 and  4.  In 
Chapter  2 the  conditions  for  consistency  are  re-cast  in  the  form  of  demon- 
strating the  separation  property  inherent  in  the  input-output-state  relationship. 
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PREFACE 


In  Chapter  3 the  state-space  representation  is  formalized  for  linear  differential 
systems,  and  Chapter  4 highlights  the  canonical  form  of  the  state-space  represen- 
tation, i.e. 


X = Ax  + Bu 
y = Cx  + Du 

In  Chapter  5 solutions  to  the  canonical  equations  are  presented  for  a 
variety  of  systems.  A natural  fall-out  of  the  state-space  formulation  are  the 
theoretical  concepts  of  controlability,  observability  and  stability,  which  are 
addressed  in  Chapter  6. 

In  the  last  two  chapters  the  reader  is  reminded  that  the  linear  theory 
developed  thus  far  has  limited  application  to  real  world  problems,  such  as  treat- 
ing large  complex  systems.  The  objective  of  Chapter  7 is  to  introduce  the  idea  of 
an  imprecisely  defined  (or  probabilistic)  system.  The  methods  of  extracting 
signals  from  noise,  both  of  which  are  treated  as  random  processes,  are  discussed 
as  a means  to  augment  the  linear  theory.  Finally,  Chapter  8,  which  deals  with 
quantized  systems,  ends  this  volume  by  highlighting  the  perturbation  method  for 
treating  state  transition  probabilities.  It  exemplifies  the  high  degree  of  difficulty 
encountered  in  analyzing  statistical  systems. 
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Fundamental  Concepts 
and  Definitions 


l.t  (NTROOUCTION 

Linear  system  theory  is  intended  as  a discipline  to  provide  a unified  conceptual 
framework  for  system  analysis.  In  establishing  this  framework  we  will  introduce 
such  notions  as  abstract  objects,  their  measurable  attributes,  and  the  mathemati- 
cal relations  between  attributes.  These  concepts  will  serve  as  the  primitives  of 
the  theory,  although  they  defy  precise  definition  in  unequivocal  terms.  Our  goal 
will  be  to  identify  a small  part  of  the  physical  world  we  intuitively  understand 
and  attempt  to  evolve  a quantitative  basis  for  analysis.  The  proof  of  this  quanti- 
tative understanding  will  be  manifest  in  our  ability  to  better  describe  the 
behavior  of  linear  differential  systems. 

We  think  of  a system  (in  vague  terms)  as  being  a collection  of  things  or 
objects  which  somehow  are  united  through  interaction  or  interdependence.  More 
precisely,  we  define*  a system  to  be  a partially  interconnected  set  of  abstract 

1.  After  Zadeh  and  Dcsoer,  Linear  System  Theory.  McGraw-Hill,  New  York,  1963.  pp.  1-65. 
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objects,  which  are  called  the  system  components.  These  components  may  be 
oriented  or  nonoriented:  they  may  be  finite  or  infinite  in  number;  and  each  of 
them  may  be  associated  with  a finite  or  infinite  numbei  of  terminal  variables.  It 
is  not  intended  or  expected  at  this  time  that  the  above  definition  be  fully 
comprehended  by  the  reader.  The  terms  used  to  define  the  system  arc  them- 
selves undefined.  Rather,  the  intent  here  is  to  set  the  stage  for  the  discussion  to 
follow.  The  remainder  of  the  chapter  will  be  devoted  to  clarifying  and  con- 
cretizing the  concept  of  a system,  and  some  of  its  ramifications. 

1.2  TIME  FUNCTIONS 

As  a foundation  for  the  sequel  our  initial  goal  is  to  establish  the  terms  of 
reference,  particularly  those  addressing  time  dependent  variables.  Consider  an 
object  (as  yet  undefined)  which  we  will  label  tf,  and  let  u be  a measurable 
attribute  of  tf.  u can  be  real  or  complex.  Let  it  be  understood  that  w is  a time 
function  where  time  ranges  from  to  °°.  Let  Tbe  a specific  subset  of  time  and 
t an  element  within  T.  (Typically,  T may  be  the  semi-infinite  interval  (/o.°°l> 
where  fi)  is  a particular  value  of  t\  or  T may  be  a finite  interval  ]>  The 
time  function  u defined  on  the  subset  T will  be  denotes  simply  as  u or  as 
tr[fo,tll>  where  [roi^ll  the  time  segment  T.  On  the  other  hand  m(?)  will 
denote  the  value  of  u at  time  t.  Thus,  u is  meant  as  t-hc  entire  set  of  pairs 
{(f,u(r))}  for  each  t in  T.  In  the  specific  time  segment  [/q.^i]  the  function 
M[f()>fil  is  the  totality  of  pairs  {(r,u(r))}  with  Figure  1.1  further 

illustrates  what  is  meant  by  u\tQ,t\ ] and  u{t)  for  real-valued  variables. 

In  general  the  discussions  to  follow  will  involve  the  set  or  class  of  time 
functions  {«}  more  so  than  a single  time  function  u.  In  the  case  u may  be 
regarded  as  a real  or  complex  variable  ranging  over  {«}.  The  range  of  u will  be 
denoted  as  /?[u] ; it  is  the  set  of  time  functions  to  which  u belongs.  Wlien  u 
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varies  over  , the  values  of  u for  fixed  t vary  over  a set  which  is  the  range  of 
the  variable  u(t).  The  range  of  u(r)  will  be  denoted  by  /?  [«(/))  • In  general 
/?[t<(r)]  is  independent  of  t.  Figure  1.2  further  illustrates /?lu(r)|  and/?|M]  for 
real-valued  variables.  The  distinction  between  /J[u(/)]  and  is  that  the 

former  includes  the  set  of  real  numbers  over  (-°°,°°),  whereas  the  latter  includes 
a set  of  time  functions. 

The  discussion  for  scalar  time  functions  can  readily  be  extended  to  include 
vector  (and  matrix)  functions.  In  bold  face  u(f)  denotes  an  n-vector.  The  range 
R”  of  u(r)  is  the  space  of  w-tuples  of  real  numbers  whose  values  are  assumed  at 
time  t.  (The  range  C"  will  be  used  to  denote  the  set  of  ordered  n-tuples  where 

( the  elements  of  u(t)  are  complex  numbers.)  We  have 

i 

’ u(f)  = (Mi(r),«2(f), ...,«„(?))  (11) 


where  m,-  (/  = 1 ,. . .,«)  are  real  or  complex  numbers  and  the  parameter  t identifies 
the  values  of  the  Uj  at  time  t. 

The  vector  function  u is  the  totality  of  pairs  {(r,u(/))}  for  each  t 'mT. 
For  the  specified  time  interval  [/o>^|]  the  vector  u[to  .ti  1 consists  of  the  pairs 
{(r,u(f))},  where  fg  < t < ti  • In  the  sequel  we  will  generally  denote  the  vector 
function  u[/o>^i  1 simply  asu,i.e., 


u = u[ro,/il 


'(ui(?0),«2(to).---“n(fo))'l 

( ) 

,(“(('lX"2('j) "n(fl))> 


(1.2) 


where  /q  and  t\  are  the  parameters  associated  with  the  time  segment  T. 
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Typically,  the  vector  u is  a vector-valued  time  function  which,  (a)  may 
contain  a finite  number  of  delta  functions  of  various  orders  over  a finite  interval, 
and  (b)  is  piecewise  continuous  and  has  piecewise  continuous  derivatives  of 
finite  orders  on  every  finite  interval  over  which  it  has  no  delta  functions.  Thus,  it 
will  be  assumed  that  ^[u(f)l  is  R"  and  /?(u]  is  the  space  of  all  vector-valued 
time  functions  which  are  finitely  differentiable  over  the  time  segments  T. 
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1.3  OBJECTS,  ATTRIBUTES  AND  TERMINAL  RELATIONS 

An  object  can  be  labeled  as  either  physical  !P  or  abstract  (f.  (Within  these  cate- 
gories we  can  further  classify  the  objects  as  either  oriented  or  non-oriented.)  By 
a physical  object  it  is  usually  meant  a physical  device  which  is  associated  with  a 

set  of  attributes  mi,U2.“3> - where  the  relations  between  these  attributes 

necessarily  characterize  the  object.  In  effect,  an  object  is  a set  of  variables  with  a 
defined  set  of  relations  between  them.  Specifically,  these  variables  are  called  the 
terminal  variables  of  object  &;  the  relations  between  them  are  the  terminal 
relations.  The  characterization  of  (i  by  its  terminal  relations  can  be  written 
symbolically  as 


(f'(t/|,U2,...,M„)  = 0 
M2, ...,«„)  = 0 


Cf"’(ui,«2. •••.“«)  = 0 

where  each  Qi,  j = l,2,...,nt,  represents  a relation  between  the  variables  i = 
1 ,2,...,n. 

If  an  object  is  characterized  by  terminal  relations  of  form  (1 .3)  and  no 
distinction  is  made  as  to  whether  the  variables  are  inputs  (causes)  or  outputs 
(effects)  then  (J  is  said  to  be  nonoriented.  On  the  other  hand,  if  the  variables  are 
clearly  categorized  as  inputs  and  outputs  (dependent  and  independent  variables), 
then  (f  is  said  to  be  oriented. 

In  keeping  with  the  idea  of  an  oriented  object  we  postulate  as  the  input  to 
the  Ar-tuple  u = («|  ,...Uk).  The  elements  u\,U2,  -.  are  time  functions  varying 
over  the  interval  (fo>fi)-  Similarly,  we  have  for  the  output  of  (f  the  »n-tuple 

The  range  of  u(r),  which  is  independent  of  time,  constitutes  the  input 
space  of  the  object.  The  range  of  the  segmented  time  function  u = u(/o,t|)  is 
the  input  segment  space.  It  is  important  to  note  that  f?[u]  depends  on  the 
parameters  Iq  and  fj  which  vary  over  T.  In  essence,  (i  is  associated  with  a family 
/?[u|  generated  by  the  parameters  fg  and  t| . The  same  can  be  said  for  the  range 
of  the  output  vector  /?  Ivl . 
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In  principle  the  relations  between  the  «,•  and  yj  can  be  established  by 
letting  u vary  over  its  allowable  range  /?(u]  and  observing  y.  The  input-output 
relations  of  (1.3)  can  be  expressed  as 

(1.4) 

(('"(«], >'2. •••3'm)  = 0 
or 

(u,v)  = 0 

V = 3(u)  (1-5) 

To  illustrate  equation  (1.5)  let  the  physical  object  f be  an  electrical 
capacitor.  A simple  experiment  is  devised  in  which  the  voltage  v across  the 
capacitor  is  being  measured.  Let  v be  designated  as  the  output  and  the  current  i 
through  the  capacitor  as  the  input.  We  have 


where  C is  a constant.  This  relation  can  readily  be  put  in  the  form  of  (1.5)  as 

g>(/>)  = 0 

Note  that  in  the  above  equations  v is  not  uniquely  defined  as  a function  of  i.  We 
can  determine  v to  within  an  added  constant  (constant  of  integration).  Also,  the 
roles  of  i and  v can  be  interchanged  with  no  change  in  formulation  or  results. 

1.4  ORIENTED  MODEL 

An  abstract  object  (J  that  admits  to  (1.5)  is  an  abstract  oriented  model  of  i*.  It 
can  be  represented  as  (((S’).  Similarly,  IP  is  a physical  realization  of  (f  and  can  be 
represented  as  ;P((f). 

An  abstract  oriented  model  can  be  formulated  to  represent  every  physical 
object.  However,  the  converse  is  not  always  true,  i.e.,  it  is  possible  to  generate 
abstract  oriented  and  nonoriented  models  for  which  physical  realizations  are  not 
possible.  (Clearly,  an  abstract  object  whose  terminal  variables,  say  vi  and  vj,  are 
represented  by  a lolation  of  the  form  v\  =jV2,  where  /=\/M,  cannot  be 
realized.)  On  the  other  hand  if  (f  is  physically  realizable,  it  can  be  realized  in  a 
variety  of  physical  forms,  - Thus  (f(^f’i),tf(5’2)"  a'l  represent  the 
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same  abstract  object  Q.  To  illustrate  this  point,  consider  the  straight  line  motion 
of  a mass  m accelerating  along  the  OX  axis.  Let  fP  represent  the  physical  mass. 
The  variable  attributes  of  !P  are  chosen  as  the  foice  F applied  along  the  axis,  the 
position  X,  velocity  x and  acceleration  x.  Our  experiment  is  to  vary  F and 
observe  x.  For  the  abstract  oriented  model  of  f,  i.e.,  Q(f),  we  designate  the 
input-output  as  u{=F)  andy(=x).  Thus,  the  input-output  relationship  is 

fl(?)  = fl(F,x)  = 0 

where 

F = mx 

The  corresponding  relation  for  !P(fl)is 

no.)  = = 0 

where 

u = my 

If  we  start  with  (J,  where  u = my,  then  !P  is  a physical  realization  of  3 with  x and 
F identified  as  u and  j,  respectively.  Similarly,  3 can  be  realized  by  a resistor 
where  the  current  through  the  resistor  is  identified  with  y and  the  voltage  across 
the  resistor  is  identified  with  u.  Other  similar  physical  examples  of  3 can  easily 
be  identified  pointing  out  the  fact  that  the  abstract  model  3 can  be  realized  in  a 
variety  of  physical  forms  9. 

To  solidify  the  analytic  meaning  of  an  oriented  abstract  object  consider 
the  observation  interval  Let  th®  vector  pair  (u,y), where  u = u[fo,ti] 

and  y = y [fo-fl]  > be  an  ordered  pair  of  time  functions  defined  on  [ ] • Let 

the  set  {(u,y)}  be  the  family  of  such  pairs  generated  by  varying  to  and  ty  over 
the  time  segment  T,  with  > to-  Accordingly,  an  oriented  abstract  object  Gi  is 
defined*  as  a family  of  sets  of  ordered  pairs  of  time  functions  {{(«,y)}}-  The 
generic  pair  is  (u,y).  u is  the  input  segment  and  y is  the  output  segment.  (u,y)  is 
the  input-output  pair  belonging  to  3 if  (u,y)  is  an  element  of  the  set  {(u,y)} 
for  some  [fo.fi]  in  T.  Thus,  an  oriented  abstract  object  can  be  identified  with 
the  totality  of  input-output  pairs  belonging  to  3. 

To  ensure  that  the  family  of  sets  {{(u,y)}}  is  defined  in  a consistent 
manner,  we  require  that  all  members  of  {{(u,y)}}  satisfy  the  consistency  con- 

1.  op.  cit. 

Note:  Notationally  { } denotes  a set  (or  family).  The  double  {{  }}  denotes  a family  of  sets. 
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dition.  The  condition  for  consistency  is  that  if  (u[  to.fi  ] .V [ ] ) is  an  input- 

output  pair  belonging  to  (3,  then  any  section  of  the  pair  within  [/o,/i]  also 
belongs  to  (J.  Specifically,  for  any  pair  (u[to,ti]  ,y  [to,T|  ] ) where  to  ^ tq  < T|  , 
To  < Ti  < ti,  we  require  that  u[to,ti]  =u[/o,/il  andy[To,Ti]  =y[to»^ll 
the  interval  [tq.T]  ] . 

The  sets  of  all  segments  of  u and  y over  such  that  the  pair  (u,y) 

belongs  to  object  3,  are  referred  to  as  the  input  segment  space  f?[uj  and  output 
segment  space  f?[y] , respectively.  It  is  implied  that  the  set  {(u,y)}  is  a subset  of 
the  product  space  /?[u]  X f?[y] . This  relationship  between  {(u,y)}  and  [u]  X 
/?[y]  is  illustrated  in  Figure  1.3.  Axes  aa'  and  represent  f?(u]  andf?[y], 
respectively.  The  area  ABCD  represents  the  product  space  created  by  R [uj  X 
/?[y] . The  object  area  represents  {(u,y)}. 

I As  a simple  example  of  an  ordered  input-output  pair  belonging  to  the 


abstract  object  3 consider  the  mass  object  of  Section  1.4.  The  dynamical 
behavior  of  the  mass  m under  the  influence  of  an  external  force  F is  described 
by  the  equation 
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u 


Designating  the  input  and  output , respectively , as  u = F and  y = x we  have  from 
equation  (1 .5) 


S(u,y)  = 0 


dy 


dt 


The  generic  input-output  pair  (u,y)  over  the  finite  time  interval  (?o-/|)  's  the 
pair  of  time  functions 


u(t) 


JtQ  ! 


where  /q  < ^ and  a is  a real  arbitrary  constant  (of  integration).  Such  pairs  as 

(1, /).(!, l+r),(l,2+r) (f, l + r^/2), (/,2  + /^/2),...,  represent  a family  of  sets 

of  input-output  pairs  for  the  mass  object.  (For  convenience  the  mass  m was 
normalized  to  unity.) 

As  another  example  of  an  oriented  abstract  object  consider  a simple  feed- 
back control  system  characterized  by  the  dynamic  relation 


dt 


= ay  + bu 


where  a and  b are  constants.  Again,  for  convenience  we  will  set  both  constants 
equal  to  unity.  Solving  for  y in  terms  of  u,  we  have  for  (u,y)  over  (to,ti ),  where 
Zq  < ^ and  a is  an  arbitrary  real  constant. 


I 


a + 


/"• 

'’tn 


Importantly,  we  note  that,  in  both  examples  cited,  to  each  input  segment 
there  corresponds  a family  of  output  segments  {/[/o>ti]  }■  The  arbi- 
trary constants  a and  a in  the  examples  play  the  respective  role  of  parameters 
generating  the  families. 

In  our  definition  of  an  object  we  have  required  that,  for  consistency,  every 
segment  of  an  input-output  pair  belonging  to  Q also  be  an  input-output  pair 
belonging  to  (I.  Qearly,  there  are  segments  of  input-output  pairs  which  are  not 
segments  of  other  input -output  pairs.  For  example,  consider  an  object  character- 
ized by 


y = u 


dt 


dt 


-f  = — t > 0 
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For  />0  input-output  pairs  such  as  (1,3),(4,5),(6, 1 where  u and  are 
constants,  all  satisfy  the  characterization  relationsliip  = li.  However,  they  are 
not  pairs  defined  on  the  interval  t > to,  (q  < 0,  where  (u(f ),«(?))  is  the  generic 
form.  It  is  for  this  reason  we  define  tf  as  a family  of  sets  of  input-output  pairs. 

In  the  applications  to  follow  it  will  be  sufficient  to  restrict  our  attention 
to  objects  characterized  by  a single  set  of  input-output  pairs  defined  over  the 
interval  T.  Such  objects  are  said  to  be  uniform.  Specifically,  an  object  (J  is  said 
to  be  a uniform  oriented  object  if  every  input-output  pair  (u[fo,fi  ] , y[tO'fl  I ) 
belonging  to  (f  is  a segment  of  an  input-output  pair  (uj-,Yr)  defined  over  T. 
Thus  a uniform  oriented  object  can  be  characterized  by  a single  set  of  pairs 
{(u7-,y/-)}.  If,  for  example,  T=  [0,°°)  then  (3  is  characterized  by  the  single  set 
of  ordered  pairs  {(u[0,°°),  y [0,°°))}  defined  over  T.  For  convenience,  and 
unless  otherwise  stated  in  the  sequel,  we  will  assume  that  3 is  a uniform  oriented 
object,  and  that  T= 


1.5  NOTION  OF  STATE 


The  abstract  object  has  been  defined  as  a relation,  i.e.,  a set  of  ordered  pairs 
(u,y)  rather  than  a function  where  for  each  u there  corresponds  a unique  y.  To  a 
given  u there  is  associated  a set  of  output  y’s.  u and  each  y in  the  set  comprise  an 
input-output  pair  belonging  to  3.  Thus,  we  have  departed  from  the  conventional 
definitions  which  identify  3 within  a function  (or  an  operator)  and  associate  a 
unique  output  for  each  input. 

The  set  of  distinct  y’s  associated  with  a given  input  u is  generated  by  the 
different  initial  conditions  under  which  u influences  3.  (These  different  initial 
conditions  are  equivalent  to  “initial  states.”)  One  way  of  associating  a unique  y 
with  each  u is  to  attach  a distinguishing  lable  to  each  pair  (u,y).  In  the  discus- 
sion to  follow  this  label,  which  we  denote  as  x(/o),  will  be  called  the  state  of  3 
at  time  to  - The  state  will  span  a (state)  space,  which  we  will  denote  as  in  such 
a way  that  y will  be  uniquely  determined  by  u and  x(fo)-  The  process  of 
attaching  the  state  label  is  called  parametrization  of  the  space  of  input-output 
I pairs.  Essentially,  by  establishing  the  state  of  3 at  time  to  we  separate  the  past 

from  the  future.  We  provide  that  information  about  the  past  that  is  relevant  to 
I determining  the  response  of  3 beginning  at  time  to- 


1.6  STATE-INPUT-OUTPUT  RELATIONS 


There  are  many  ways  in  which  a state  vector  x(/o)  can  be  associated  with  3. 
Therefore  the  definitions  given  below  will  be  a set  of  qualifying  conditions 
which,  if  met,  allow  one  to  label  the  state  of  3 at  time  to  as  x(/o). 


' V 


- • 
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For  discussion  purposes  consider  the  semi-closed  interval  (/o./l  and  the 
ordered  pair  (u(ro,t] , v(/o.t)).  which  is  a segment  of  (ut-.v/')  defined  over  T- 
Let  (f  be  characterized  over  ((qJ]  by  input-output  relation  (1.5),  i.e.,  by 

(}(u,y)  = 0 
y = (i(u) 

(This  is  equivalent  to  saying  C(  is  characterized  by  the  family  of  sets  of  ordered 
pairs  {{(u,y)}}.  Let  x be  a (vector)  variable  ranging  over  space  It  is  claimed' 
that  if  the  spaces  of  input-output  pairs  admit  to  a relation  of  the  form 

y(/0,^]  = F(x;u(ro.^l)  (1-6) 

which  satisfies  the  four  mutual-  and  self-consistency  conditions  set  forth  below, 
then  (a)  equation  (1.6)  qualifies  as  an  input-output-state  relation  for  (i,  (b)  space 
'F  qualifies  as  the  state-space  for  (f,  where  elements  of  'I'  are  the  states,  and  (c) 
the  variable  xq  = x(fo)  qualifies  as  the  state  of  (f  at  time  tg-  Thus,  if  the  cor- 
sistency  conditions  as  defined  below  are  satisfied  it  can  be  said  that  d is  com- 
pletely characterized  by  (1.6),  where  y(/o>fI  is  the  response  segment  of  (S  to 
input  segment  u(/o-f]  starting  in  state  xq  and  (u(fo.^].y(^0.^1)  the  input- 
output  pair  associated  with  xq. 

The  conditions  of  mutual-  and  self-consistency  which  qualify  (1.6)  as  the 
input-output-state  relation  for  (3  are  defined^  below. 


1.7  CONDITION  LMUTUAL-CONSISTENCY 

Every  input-output  pair  for  satisfies  relation  (1.6),  and  conversely.  To  elabo- 
rate, if  (u(ro,?l,  y(/o>t])>  or  simply  (u,y),  satisfies  (1.5)  then  the  pair  also 
satisfies  (1.6)  since  there  exists  an  x (say  xo)  in  'I'  such  that 

y(/0'^]  = F(xo;u(?o.r])  (1-7) 

Conversely,  any  pair  (u,y)  satisfying  (1 .6)  for  some  x in  'I'  over  (fo>l]  's  also  an 
input-output  pair  for  (J.  This  condition  must  hold  for  all  (fo>^]  *n  Tand  for  all 
u in  f?[ul.  The  purpose  of  the  mutual-consistency  condition  is  to  ensure  that 
equations  (1 .5)  and  (1.6)  both  represent  the  same  object. 

We  demonstrate  the  mutual-consistency  condition  by  again  considering  the 
mass  object  of  Section  1 .4.  (J  is  characterized  by 

1.  op.  C/t. 

2.  op.  cit. 


FUNDAMENTAL  CONCEPTS  AND  DEFINITIONS 


11 


— = U ("1=1)  (1-8) 

dt 

The  first  part  of  the  mutual-consistency  condition  is  fulfilled  in  the  sense  that 
(1 .8)  can  be  written  as 


where 


fl(u,y)  = 0 


(u,y)  = + J 


r 


4 


thereby  satisfying  (1 .5).  Further,  the  output  y satisfies  (1 .6)  since  we  can  write, 
with  proper  choice  of  x, 


AO  = xo  + 


u(i)di 


= fXxo’AO) 


Conversely,  consider  the  input-output  state  relation  of  the  form 


y(r)  = 0+1  u(?)d| 

•*10 

= F(a;u(0)  (1-9) 

With  proper  choice  of  x (=a)  the  pair  (u(f)^y(r))  characterizing  Cf  satisfies  (1.6). 
By  direct  substitution  of  (1.9)  into  (1.8)  it  follows  that  every  pair  (u,y)  satis- 
fying (1 .9)  also  satisfies  (1 .8).  Thus,  every  input-output  pair  for  (3  characterized 
by  (1 .8)  satisfies  (1 .6),  and  vice  versa. 


1.8  CONDITION  II,  FIRST  SELF-CONSISTENCY 

The  response  y(r)  at  any  time  r > fo  uniquely  determined  by  x and  u(ro.t] . 
This  must  be  true  for  all  to-  To  qualify  as  a state  space  of  fl,  the  space  W must 
have  the  property  that,  given  any  point  x in  W and  any  Input  u(fo,t]  (defined 
over  the  input  segment  space),  the  output  at  time  t is  uniquely  determined  by  x 


I 


■it  ' ♦ • 
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and  u.  The  point  x at  time  to  will  be  called  the  state  of  (f  at  to-  The  output  is 
independent  of  u or  y prior  to  time  to  - This  is  a key  property  of  the  state  space 
concept. 

An  input-output-state  relation  of  form  (1.9)  clearly  satisfies  the  first  self- 
consistency  condition-regardless  of  the  range  of  a.  (For  all  /q  *he  output  y at 
time  t is  uniquely  determined  by  a and  u.y  is  independent  of  m or>’  prior  to  /q-) 
However,  by  changing  the  upper  limit  of  the  integral  (1 .9)  from  t to  f + 1,  i.e.,  if 
we  have 


y(t)  = a + 


we  no  longer  satisfy  the  first  self-consistency  condition.  The  output  at  time  t 
cannot  be  determined  without  knowledge  of  u(|)  between  t and  f + 1 . 


1.9  CONDITION  III,  SECOND  SELF-CONSISTENCY 

If  the  input-output  pair  (u(fo>^ll.  v[^0>fl])  satisfies  (1.6),  then  (u[r,r|j, 
y 1 /,fl  ])  also  satisfies  (1 .7).  u(/,^]]  andyff,/]]  are  sections  of  u[?o. ft  ] and 
respectively,  and  to<t<ti.  This  must  hold  for  all  x in  all 
u[fo-fiI  in  the  input  segment  space,  and  all  to,t,ti-  The  purpose  of  this 
condition  is  to  ensure  that  the  state  space  'it  can  include  all  possible  initial 
conditions  for  (f. 

To  help  clarify  the  meaning  of  the  second  self-consistency  condition  con- 
sider the  input  to  (3  over  (/o-fl  1 as  consisting  of  two  contiguous  segments, 
u®(foM  followed  by  u'(/,fi] , i.e., 

u = 

(Note:  u®u’  is  not  to  be  interpreted  as  the  product  of  u*'  and  u’.)  If  the 
input-output  pair  (u®u’,y®y')  satisfies  (1.6),  then  the  segment  response  to 
u^u'  beginning  in  state  x(^o)  = <0  i* 

yOy'  = F(xo;u“u>)  (1.10) 

To  say  that  (u*,y*)  also  satisfies  (1.6),  as  the  second  self-consistency  condition 
does,  means  that  there  must  be  some  values  of  x in  'I'  such  that 


y>  = F(x;u>) 


for  all  arbitrary  times  r,  where  to<t<t\.  Denoting  these  values  of  x in  'F  as 


mm 
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the  set  A the  second  self-consistency  condition  requires  that  this  set  be  non- 
empty. (Figure  1 .4  graphically  illustrates  the  idea.) 

To  further  illustrate  the  second  self-consistency  condition  let  (3  be  charac- 
terized by  the  relation 

xf)  = + j*  'ua)dk  (1.12) 

J to 

u and  V are  real-valued  time  functions  and  4'  includes  the  set  of  all  real  numbers, 
i.e.,  a = x is  any  real  number.  Let’s  consider  the  value  of  y at  the  particular  time 
I = f.  For  ^ = t equation  (1.12)  becomes 

y(t)  = + f u(t)dt  (1.13) 

Jto 

Let  the  input  u(t)  over  the  observation  interval  [0,5]  be  u(t)  = -3t^  and  let  the 
initial  conditions  at  time  Zq  = 0 be  xq  = cto  = 1 . From  (1.13)  the  response  over 
[0,5]  is 

>'(0  = 1 - 

Thus,  the  input-output  pair  associated  with  xq  = 1 for  the  interval  [0,5]  is 

(-3/2,1- f3). 
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Now  consider  the  intermediate  time  t = 2.  The  second  self-consistency 
condition  requires  that  if  the  input-output  pair  (m(0,5J,>'(0,5]  ),  which  is 
(-3/2,1-  /^),  satisfies  (1.6),  then  the  same  pair  over  the  time  interval  (2,5)  also 
must  satisfy  (1 .6).  Since  ff  is  characterized  by  (1 .12),  we  have  for  [2,5] , 


„r-, 


Also,  from  (1.12)  we  can  write  for  the  pair  (-3/2,1-  /^) 


y{t)  = 1 + 


The  only  way  the  pair  (-  3/2, 1 - /2  ) can  satisfy  (1 . 1 4)  is  if 


a2  = 1 - I 3t^dt 


Clearly,  a2  = -7  has  no  solution  in  where  4^  consists  of  real  numbers.  There- 
fore A is  empty  and  the  second  self-consistency  condition  is  not  satisfied  by 
relation  (1.12). 

On  the  other  hand,  if  (2  were  characterized  by  an  input-output-state  rela- 
tion of  the  form 


Jtr. 


u{k)d% 


instead  of  (1.12),  then  for  any  xq  in  'I'  and  any  | in  [/o,/il,  where  /q  < I < /i. 
we  can  write 


The  pair  («(/,/[  ),y{t,t\ ))  is  an  input-output  pair  with  respect  to 

.«l  = ao  + I uiX)d% 

•’to 


J 
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It  is  seen  from  (1.18)  that  for  any  real-valued  time  function  u in  the  input  space 
the  state  x\  assumes  a real  value  in  given  that  Oq  is  real.  Thus  an  input- 
output-state  relation  of  form  (1.16)  satisfies  the  second  self-consistency 
condition. 


1.10  CONDITION  IV,  THIRD  SELF-CONSISTENCY 

We  defined  an  object  (f  as  a family  of  sets  of  input-output  pairs.  To  complete 
the  parametrization  of  the  input-output  pairs  it  remains  to  examine  the  meaning 
of  the  term  “the  state  of  (f  at  time  t,"  where  t is  arbitrary.  Tliis  is  equivalent  to 
asking  what  happens  to  the  set  A of  values  of  x in  'I'  as  the  input  segment 
u’(r,?||  varies  over  the  input  segment  space  of  Cf.  We  want  to  portray  the 
situation  as  one  where  xq  and  u®  are  held  fixed  and  u’  is  varied  over  the  entire 
input  segment  space. 

From  the  first  and  second  self-consistency  conditions  we  readily  deduce 
that  the  set  A depends  on  xq,  u®  and  u*  (where  the  variable  t is  arbitrarily 
fixed).  We  can  write  for  A 

A = >4(xo;u®u*)  (1.19) 

Now  consider  the  situation  where  u*  varies  over  the  input  segment  space  of  (f, 
i.e.,  u’  assumes  the  variations  U|*  ,U2* , — Accordingly,  the  corresponding  ^4  sets 
become  /4(xo;u9u|’),A(xo;u9u2*), . . . If  we  form  the  intersection  of  all  the  >1 
sets  (denoted  by  the  A'  subset)  this  intersection  (if  not  empty)  is  the  set  of  all 
points  X in  'I'  to  which  every  pair  (u ’ ,y ' ) satisfying 

y‘V  = F(xo;u«u‘)  (1.20) 

also  satisfies 

y>  = F(x;u')  (1.21) 

for  all  X in  the  intersection. 

Based  on  the  above  discussion  we  state  the  third  self-consistency  condition 
as  follows:  Let  (u^u’.yOy* ) be  an  input-output  pair  satisfying  (1 .6)  with  respect 
to  some  XQ,  i.e., 

yOyi  = F(xo;u®u') 

Also,  let  A(xo;u*^u’)  be  the  set  of  all  x where  (u’,y’)  is  an  input-output  pair 
satisfying  (1 .7),  i.e., 

y*  = F(x;u’) 
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Tlie  third  self-consistency  condition  requires  that  the  intersection  of  all 
/4(xo;u'*u*)  taken  over  all  u'(^./il  in  the  input  segment  space  be  nonempty  for 
all  f().  till  *0  tn  and  all  segments  u(fo,/|]  in  the  input  segment  space  of  (f. 

From  tile  way  in  which  the  third  self-consistency  condition  is  formulated 
it  cannot  be  satisfied  unless  all  of  the  ^(xo;u‘*u')  sets  are  nonempty.  This 
implies  that  the  second  self-consistency  conditi  is  also  satisfied.  However,  the 
converse  is  not  true. 

The  meaning  of  the  third  self-consistency  condition  is  readily  demon- 
strated by  example  (1.16).  Let  (f  be  characterized  by  an  input-output  relation  of 
the  form 

Xf)  = -fo  + r (1.22) 

Jfo 

We  establish  the  following  conditions  under  which  the  third  self-consistency 
condition  will  be  evaluated:  fo  “ 0-  intermediate  time  t = 2,  t\>2, 

u{%)  = 2?  for  0 < ^ < r,  and  u(i)  = variable  for  2 < J < ?! . Dividing  the  integra- 
tion range  of  (1 .22)  into  (0,2)  and  [2. a],  where  2 <a  < r,  we  have 

y(a)  = 4 + J 2?c/|  + u(k)d^  2 <a  < t 


which  is  of  the  same  form  as  (1.22).  This  implies  that  (n(2,r|  | . v(2,r]  | ),  with 
>’(2,/)  ] defined  by  (1.23),  satisfies  (1 .22)  with  x = &.  Ttierefore,  the  A sets  as 
defined  by  (1.19),  (1.20)  and  (1.21)  are  nonempty.  These  sets  all  contain  the 
point  X = 8,  and  hence,  so  does  their  intersection.  We  conclude  A is  nonempty 
when  r = 2,  u(^)=2^  for  0<|</,  and  that  relation  (1.22)  satisfies  the  third 
self-consistency  condition. 


1.11  STATE  AT  TIME  f 

For  the  discussion  to  follow  we  denote  the  intersection  of  all  the  A sets  (formed 
by  varying  u*  over  the  entire  input  segment  space)  as  the  A'  subset,  (It  has  been 
said  that  A'  is  the  set  of  all  points  x in  4'  to  which  every  pair  (u',y' ) satisfying 
(1.20)  also  satisfies  (1.21).)  Clearly,  A'  contains  values  of  x where  the  response 
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to  u*  is  the  same  for  all  starting  states  in  A'.  This  is  true  for  all  u’  in  the  input 
segment  space.  Therefore,  the  third  self-consistency  condition  provides  for  the 
following  definition’  of  the  slate  of  (i  at  time  t:  Given  that  Cf  is  characterized 
by  (1.6)  and  the  response  of  (f  to  input  segment  u”u’  starting  in  state  xn  is 
described  by  (1 .20)  and  (1 .21 ),  then  the  state  of  (f  at  time  t is  any  state  x in  the 
a'  subset. 

Symbolically,  the  state  of  (t  at  time  t will  be  denoted  as  x(/),  bearing  in 
mind  that  this  is  merely  a label  to  identify  an  element  of  zl'(xn;u”u’)  with  the 
state  at  time  t.  Similarly,  x(/o)  will  notationally  identify  with  the  initial  state  of 
tf  at  time  fi)-  Accordingly,  the  response  at  time  t can  be  written,  in  accordance 
with  (1 .6),  as 

v(0  = y(x(to);u(fi)-^l ) (1-24) 

We  note  from  (1.24)  that,  for  each  fixed  t.  x(/o)  and  x(f)  range  over  the  entire 
state  space  'J',  i.e.,  /?[x(7)]  = 'T.  Tlie  components  of  x(f)  will  therefore  be  desig- 
nated as  the  state  variables  or  elements  of  the  state  vector  x(t). 

On  considering  an  input  u(/o,/i]  applied  to  (f  while  Cl'  is  in  state  x(to),  it 
follows  that  x(o)  is  the  terminal  slate.  If.  however,  we  have  u(/o,/i]  followed 
by  u(r  1 ,/2 1 .u(^2-f3]»- • ^^ch  of  the  respective  terminal  states 

x(/|),x(7?).x(73) plays  a dual  role;  each  represents  both  tire  terminal  state 

relative  to  input  segment  t and  the  input  state  relative  to  input  segment  r + 1 . 

For  objects  characterized  by  input-output  relations  of  the  form  (1.5),  i.e., 
Cf(u.y)  = 0,  the  standard  practice  is  to  associate  x(r)  with  such  objects  by  defin- 
ing x(r)  in  terms  of  u(?o-^l  attd  ^nd  to  verify  that  the  four  consistency 

conditions  are  satisfied.  In  cases  where  (1.5)  is  a differential  equation  then  x(t) 
can  be  defined  in  terms  of  u(/)  and  y(/)  and  a finite  number  of  their  time 
derivatives.  Other  expressions  for  x(r)  in  terms  of  u(/oT|  and  y(/oT]  can  readily 
be  obtained  by  changing  the  coordinate  system  of  the  state  space.  Importantly, 
there  are  many  ways  to  associate  a state  vector  with  an  object. 

x(0  has  been  defined  to  be  an  element  of  the  set  zl'(x(t())iu^u’).  Since  A' 
depends  only  on  x(?o)  and  u(fo.fl-  then  so  must  x(r).  Thus,  we  can  write  for  the 
state  equation 

x(0  = x(x(to);u(to-tl)  (1-25) 

Under  proper  regularity  assumptions  for  x and  u equations  (1 .24)  and  (1.25)  will 
tend  in  the  limit  to  differential  equations  of  the  form 

x(0  = x(x(f),u(f) u’*’(r),t)  (1.26) 

y(f)  = y(y(/).u(/),...,u'*V/)./)  (1-27) 

I . op.  at. 
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where  x and  y are  point  functions  and  is  tlie  ^th  derivative  of  u(r). 

liquations  (1.26)  and  (1.27)  particularly  apply  to  differential  objects  and  are 
referred  to  as  the  state  equations  in  differential  form.  Differential  objects  and 
their  respective  state  equations  are  of  considerable  interest  in  system  theory  and 
will  be  the  focus  of  discussion  in  later  chapters.  In  particular,  linear  differential 
systems  and  their  state  equations  assume  the  simple  form  of 

x(t)  = A(r)x(r)  + B(f)u(0  (1.28) 

Y(r)  = C(f)x(r)  + D(f)u(r)  + D,(r)u<'>(r)  + ...  + D*(f)u**>(r)  d-29) 
where  the  coefficients  are  (time  dependent)  matrices. 


1.12  MULTIPLE  OBJECTS 

Thus  far  we  have  considered  only  the  single  object  (f . its  terminal  variables  and 
the  relations  between  them.  Out  principal  interest,  however,  is  a collection  of 
objects  that  interact  with  one  another.  We  will  now  focus  on  the  analysis  of 
multiple  objects. 

For  the  most  part  we  shall  use  the  conventional  rectangular  block  to 
graphically  describe  an  abstract  object  (f  (see  Figure  (1.5)).  Here  the  leads 
represent  the  terminal  variables,  which  can  denote  either  scalar  or  vector  varia- 
bles. The  important  thing  is  that  each  variable  is  treated  as  an  entity  when  each 
object  (f  is  interconnected  with  other  objects. 

From  equation  (1 .25),  where 

y(f)  = y(x;u) 

it  is  seen  that  each  component  of  y(f)  is  a function  of  x and  u.  Tlierefore,  we  can 
express  each  component  of  y as 


T,(0  = T/(x;u)  d-30) 

1 

^ This  implies  that  an  object,  as  shown  in  Figure  1.5.  having  C inputs  and  m 

t outputs,  can  be  represented  by  m objects  having  C 

I* 
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inputs  and  one  output  (see  Figure  1 .6).  Thus,  if  the  objects  under  consideration 
are  oriented  there  is  no  loss  in  generality  in  assuming  that  the  composite 
“object”  is  made  up  of  oriented  objects  where  each  has  one  output  variable. 

1.13  INTERCONNECTED  OBJECTS 

Essentially,  a “system”  is  a collection  of  interacting  objects.  The  interactions 
between  objects  (l|,(f2 represent  constraints  on  the  terminal  vari- 

ables. An  example  of  such  a constraint  is  the  fact  that  the  ith  terminal  variable 
of  object  U'y  is  equal  to  the  Ath  terminal  variable  of  object  (ti;,  for  all  r in  T.  The 
set  of  objects  Cfj  ,(l2 constrained  in  this  manner  is  said  to  be  an  inter- 
connection of  (1i.Ct2 (fjV-  Such  an  interconnection  can  be  represented 

graphically  as  shown  in  Figure  1 .7,  where  the  composite  object  is  designated  as 
tf.  Specifically,  the  set  of  objects  {Cf,}  /=  1,2,3 A,  is  said  to  be  an  intercon- 

nection of  the  iii  if  every  object  in  the  set  shares  at  least  one  terminal  variable 
with  one  or  more  of  the  other  objects  of  the  set.  If  some,  or  possibly  none,  of 
the  iii  share  terminal  variables  the  collection  is  partially  interconnected. 

There  are  several  simple  types  of  interconnections  encountered  frequently 
in  systems  analysis;  they  are  the  tandem  combination  and  the  parallel  combina- 
tion. In  the  sequel  we  will  briefly  highlight  both. 


Figure  1.6  KquivalenI  representation. 
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Figure  1.7  Interconnected  objects. 
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1.14  TAIMDEN  COMBINATIONS-INITIALLY  FREE 

For  an  analysis  on  nrultiple  objects  to  have  any  concreteness  it  must  be  able  to 
answer  for  the  composite  object  the  same  questions  that  are  asked  of  the  single 
object:  What  is  the  input-output-state  relationship  of  the  composite  interconnec- 
tion? Wliat  is  the  state-space'*  Wliat  is  the  output  function  space?  At  this  point  it 
must  be  noted  that  answers  to  these  questions,  particularly  for  large  complex 
systems,  have  not  been  readily  established.  In  fact  some  of  the  concepts  sug- 
gested below  are  tentative.  The  ideas  presented  below,  althougli  tentative  and 
sketchy,  are  representative  of  some  progress  in  this  area. 

In  order  to  establish  a basis  for  answering  the  above  questions  we  intro- 
duce the  idea  of  the  miriullv  free  interconnection,  which  will  be  symbolically 
denoted  as  The  connections  between  the  (t,.  as  shown  in  Figure  1.7, 

remain  open  until  t = '’o.  at  which  time  the  Q switches  are  closed.  We  regard  tf  as 
a limiting  form  (as  of  an  initially  free  interconnection  Ci(to).  It  is 

therefore  possible  to  relate  the  properties  of  (!(/)  to  those  of  Ct(/o).  thereby 
giving  a characterization  to  the  state  space  of  t1. 

Consider  a collection  of  oriented  objects  1.1| , (t2,. . . tiAt.  (as  depicted  in 
Figure  1 .7)  where  each  is  characterized  by  a relation  of  the  form 

y'  = y(x';u')  (1.31) 

If  there  is  no  interaction  between  the  various  tt/,  i.e.,  all  the  switches  are  open, 
then  the  collection  is  the  direct  product  of  (fi,Cl2 (f;y,  which  will  be  de- 

noted as  tfi  X (f2  X ...  X Tlie  input  to  each  (J,-  along  with  the  state  of 
each  lij  is  clearly  independent  of  the  inputs,  outputs  and  states  of  any  of  the 
other  objects  in  the  collection.  Accordingly  the  input  and  output  of  the  com- 
posite Q.  are  defined  to  be,  respectively,  the  composite  vectors. 

u = (u*,u^ lA)  (1 .32) 

and 

y = (y',y^ y^)  (1-33) 

The  corresponding  input  and  output  segment  spaces  of  (f  are  the  products, 
respectively. 


7?(u|  = /?(u'|  X /?(u2|  X ...  X (1.34) 
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Similarly,  the  composite  vector 

x(r)  = (x'(r),x^(r) (I.36J 

is  the  state  of  tf  at  time  /,  where  the  state  space  of  Cf  is  the  product  space 

'I'  = xlr'  X X ...  X (1.37) 

Now  let  two  of  the  fl,-,  say  (f|  and  (f2.be  connected  in  tandem,  with  (f| 
preceeding  Cf2,  as  shown  in  Figure  1 .8.  In  a tandem  combination  the  input  to 
(f  2 is  “constrained”  to  be  equal  to  the  output  y*  of  (fi . It  is  necessary  now  to 
distinguish  between  two  situations:  (1)  =y*  for  t>  to,  where  to  is  the  in- 
stant u ’ is  applied  to  (f  1 , and  (2)  = y ’ for  all  t.  Case  (1 ) is  the  initially  free 

tandem  combination,  which  will  be  considered  subsequently.  Case  (2)  is  the 
constrained  tandem  combination  and  will  be  considered  in  the  next  section.  In 
(1)  (f[  and  (f2  act  like  the  direct  product  for  t<  to-  The  switch  Q signifies  that 
is  constrained  to  be  equal  to  y * from  time  to  on,  but  not  before. 

Proceeding  with  case  (1),  where  Q is  closed  at  time  Tq,  we  assume  that  (fi 
and  (f2  are  characterized  by  relations  of  form  (1.31),  with  the  output  segment 
space  of  (fi  contained  in  the  input  segment  space  of  (f2-  Let  the  states  of  ff] 
and  0.2  at  time  t be  (as  deduced  from  (1.31)) 

x'(r)  = x(X(5';u')  (1.38) 

where  X({  = x'(fo)  and  u'  = u'(fo,f].  The  initial  states  Xg’  and  Xg^  can  be  chosen 
arbitrarily  in  'P*  and  'it^.  Therefore  given  Xg’,  Xg^  and  u*(/g,r]  we  can  find,  from 
(1.31)  and  (1.38),  the  output  y^(fo.f].  and  the  states  x'(r)  and  x^(r)  for  any 
t>to-  Thus,  the  free  product  of  the  tandem  combination  (Ji  and  ((2  is  an 
object  (i(/o)  characterized  by  the  input-output-state  relation 

= y[xg2;y(xg';u>)]  (1.39) 

where  the  composite  state  vector 

Xg  = (Xg‘,Xg^)  (1.40) 


Figure  1.8  Tandem  combination  of  11 1 and  II2. 
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plays  the  role  of  the  initial  state  of  Tlie  state  of  Ct(fo)  at  time  t is 

x(r)  = (x'(/),x2(/»  (1.41) 

where  x*(f)  and  x^(/)  are  expressed  by  (1 .38)  as 

x'(r)  = x(x„';u')  (1.42) 

x2(r)  = x(xo^;x(x,/;u'))  (1.43) 

The  corresponding  output  segment  space  for  (f(/o)  is  the  set  of  output  functions 
y2,i.e., 

/?|y2]  = {y2}  (1.44) 

where  is  determined  by  (1 .39). 

Clearly,  equation  (1.39)  satisfies  the  first  self-consistency  condition  of 
Section  1.1 1.  However,  this  alone  is  insufficient  to  qualify  (1.39)  as  the  input- 
output-state  relationship  for  (f  (/q)-  (But  it  will  be  seen  later  that  if  C((/o)  admits 
to  the  response  separation  property,  highlighted  in  Chapter  2,  then  the  com- 
posite state  vector 

x(f)  = (x'(f),x2(f),...,x^(r)) 

for  Mn  qualifies  as  the  state  vector  of  (S(ro)at  time  /.) 

It  is  important  to  note  that  the  composite  state  vector  for  the  free  product 
(?!  and  i.e. 

x(0  = (xHo.x2(0) 

is  unlike  the  initial  state  vector  (xqKxq).  The  initial  state  vector  ranges  over  the 
product  space  qr’  X The  vector  (x*(r),x^(/)),  which  is  the  state  of  (f(ro)  at 
time  t,  ranges  over  a subset  of  '1'^  X This  subset  'f'(ro,f)  is 

'I'(/0.0  = {(x>(/),x2(r))}  (1.45) 

where  xq'  and  Xq^  are  elements  of  and  respectively,  and  the  states  x'(r) 
and  x^(r)  are  given  by  (1 .42)  and  (1 .43),  respectively.  The  arguments  rg  and  r of 
the  subset  serve  to  show  the  subset’s  dependency  on  both  (q  and  /. 

Typically,  the  free  product  of  3 objects  (Jj,  Cf2<  ^*3  >s 

y^  = y[x(^y(xo^;y(xo';u'))l  (1.46) 
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Note  th  :t  (1 .46)  is  associative.  (Cf3(t2)(fi  and  (J3((}2Cfi)both  characterize  the 
same  input-output-state  relation.  Obviously,  (1.46)  can  be  extended  to  N 
objects. 


1.1S  TANDEM  COMBINATIONS-CONSTRAINED 

The  idea  of  an  initially  free  tandem  combination  of  objects  provides  a con- 
venient way  of  describing  the  constrained  combination,  i.e.,  the  combination  of, 
say  (}|  and  (^2,  in  which  the  input  to  (?2  is  constrained  for  all  time  t to  be  the 
output  of  Qi . Such  a combination,  which  will  be  designated  as  the  combination 
tf , is  depicted  in  Figure  1 .8  where  the  switch  Q is  closed  and  n*  is  applied  to  (f| 
afterwards. 

The  constraint  u^(r)  = y'(/)  for  all  t is  regarded  as  the  limiting  form  (as 
fg  ->-oo)  of  the  constraint  u^(/)=  y’(/)  fof  ^>^0-  Th^  implication  here  is  that 
the  state  space  of  (i  is  characterized  by  the  limit,  if  it  exists, 

'!'(/)  = lim  't'Uo,/)  (1.47) 

The  exact  meaning  of  (1.47)  and  the  conditions  under  which  it  exists  are  com- 
plicated and  unsettled  questions.  However,  for  our  needs  we  will  interpret  (1.47) 
to  mean  that  for  each  t we  can  approach  any  point  in  4'(r)  arbitrarily  closely  via 
a point  in  4'(Zo,^).  For  systems  of  interest  (differential  systems)  the  state  at  time 
t is  defined  by  a finite  number  of  derivatives  of  the  input  and  output  at  time  /. 
'!'(/)  can  be  determined  through  the  process  of  elimination  of  identical  variables 
from  expressions  for  states  of  (f)  and  ((2- 

Based  on  the  above  observations  the  combination  (f  can  be  regarded  as  the 
limiting  form  of  the  free  product  (((/q)  as  fg  The  state  space  of  (?  is  a 

subset  'Jr  of  X 4'^.  We  have 

'l'(f)  = lim  {(x>(0.x2(/))}  (1.48) 

to”*-" 

where  xg' , Xq^  and  u’  are  elements  of  W’,  and  /?[u*  ] , respectively.  x*(r)  and 
x^(r)  can  be  determined  by  expressions  (1.42)  and  (1.43),  respectively.  The 
corresponding  input-output-state  relationship  for  (f  is  of  form  (1.39)  and  (1.46), 
with  (xq’,Xq^)  ranging  over 

Clearly,  for  the  objects  (([,((2 the  above  analysis  can  be  ex- 

panded as  appropriate. 
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1.16  SYSTEM 

We  now  return  to  our  definition*  of  a system  as  given  in  Section  1 .2.  A system  S 
is  a partially  interconnected  set  of  objects,  called  components.  These  com- 
ponents may  be  oriented  or  nonoriented;  they  may  be  finite  or  infinite  in 
number;  and  each  may  be  associated  with  a finite  or  infinite  number  of  terminal 
variables. 

A system  may  be  regarded  as  a single  object.  Conversely,  any  single  object 
may  be  regarded  as  a system.  If  a collection  of  objects  is  con- 

sidered as  a single  system  the  set  of  terminal  variables  of  the  system  S is  the 
union  of  the  terminal  variables  of  its  components.  If  the  components  of  S are 
oriented,  then  so  is  S.  The  aggregate  of  shared  terminal  variables  and  the  non- 
shared  output  variables  of  the  various  components  together  comprise  the  output 
variables  of  S. 


1,  op.  dr.,  p.  65. 
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State  Equations 


2.1  INTRODUCTION 

The  intent  of  Chapter  1 was  a more  careful  definition  of  a system  than  is  found 
in  the  texts  of  physics,  mechanics  and  control  theory.  The  building  block  of  the 
defmtition  was  the  oriented  object,  i.e.,  the  sets  of  ordered  input-output  pairs. 
The  notion  of  state  allows  for  labeling  the  pairs,  or  the  parametrization  of  the 
space  of  input-output  pairs.  The  discussions  of  this  chapter  will  expand  on  the 
idea  of  system  state.  Specifically,  the  fundamental  properties  of  the  self  con- 
sistency conditions  will  be  examined;  the  most  important  being  the  state  separa- 
tion property.  This  property  leads  directly  to  an  input-output-state  relation,  in 
canonical  form,  for  linear  differential  systems.  The  definition  of  state,  as  formu- 
lated in  Chapters  1 and  2 is  of  general  applicability.  It  provides  a basis  for 
defining  the  states  of  complex  systems  comprised  of  both  linear  and  nonlinear 
components. 

2.2  SEPARATION  PROPERTY 

In  Sections  1.7  through  1.11  we  defined  x(f).  the  state  of  Cf  at  time  t,  where 
/>/ri'  as  an  element  of  the  intersecting  A sets  /l(x();u'’u' ).  Tliese  sets  are 


25 


26 


AN  INTRODUCTION  TO  THE  THEORY  OF  LINEAR  SYSTEMS 


created  at  fixed  t by  varying  u*  over  the  input  segment  space.  By  its  definition  A 
is  the  set  of  all  x satisfying  the  relation 

v(xo;u"u')  = y(x();u")y(x;u‘)  (2.1) 


or 


y(x(f,,);u<'u')  = y(x(/o);u«)y(x(r);u')  (2.2) 

for  all  x(ro).  u**  and  u*.  Relations  (2.1)  and  (2.2)  are  interpreted  to  mean  that 
given  an  input  consisting  of  segment  u*’  followed  by  segment  u'  the  response  of 
(f  starting  in  state  x(fo)  response  segment  y(x(ro);u*’) /oZ/mver/  by  response 
segment  y(x(f);u’),  where  x(r)  is  the  state  of  Ci  at  time  t > Iq.  This  property  of 
x(r)  is  referred  to  as  the  separation  property. 

By  specifically  designating  Xq’  as  the  terminal  state  into  which  xo  is  taken 
by  u®  equation  (2.1 ) becomes 

y(xo;u^u')  = y(x(,;u'’)y(xo';u')  (2.3) 


Equations  (2.1),  (2.2)  and  (2.3)  can  be  rewritten  as  an  expression  fory(t) 
rather  than  the  response  segment  y(ro>t)-  If  we  let  the  variable  ^ be  the  inter- 
mediate time  between  tQ  and  /]  we  have  (in  the  limit)  the  identity  correspond- 
ing to  (2.3),  i.e., 

v(x(<o);w(io.?l)  = y(x(f);u(|,/J)  to  < ^ < t (2.4) 

The  expression  y(x(|);u(|,f])  is  the  .esponse  of  (?  at  time  t to  u(|,r],  with  (3 
initially  in  x(J)  and  where  /o  < I < T 

The  analyses  of  (2.1)-(2.4)  places  in  evidence  a distinct  connection  be- 
tween the  self-consistency  conditions  and  the  separation  property.  The  relevant 
facts  are  summarized  as  follows.  If  a relation  of  the  general  form 

y(f)  = y(x;u)  (2.5) 

has  the  response  separation  property  (for  all  x and  u), 

y(xo;u(to./|)  = y(xo‘;u(trl)  to  < ^ < t (2.6) 

(where  Xq  depends  only  on  xo  and  u)  then  we  assert  that  (2.5)  satisfies  the 

three  self  consistency  conditions,  with  xo  and  being  the  states  of  (f  at  time 

io  and  respectively.  Further,  equation  (2.5)  qualifies  as  an  input-output-state 

relationship. 
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Clearly,  the  first  self-consistency  condition  is  satisfied  since  y(7)  is 
uniquely  determined  by  xq  and  u.  The  separation  property  as  stated  by  (2.6)  is, 
in  fact,  a statement  of  the  second  self-consistency  condition  (see  Section  1.12). 
By  adding  the  qualification  that  Xq’  depends  only  on  xq  and  u,  then  every  set 
/l(xo;u*'u')  will  contain  x^.  Therefore  the  intersection  of  the  A sets  formed  by 
varying  over  the  input  segment  space  will  be  nonempty.  Thus,  the  third 
self-consistency  condition  is  satisfied.  The  importance  of  the  separation  property 
is  that  for  a given  input-output-state  relation  of  form  (2.5)  verification  of  the 
self-consistency  conditions  is  reduced  to  demonstrating  that  (2.5)  has  separation 
property  (2.6). 

The  separation  property  (2.4)  can  logically  be  extended  to  include  the 
state  of  fl.  Given  that  the  input-output-state  relation  is  of  form  (2.5),  i.e., 

v(0  = v(x(/o);u(fo.^l) 

the  state  equation  induced  by  (2.5)  is 

x(t)  = x(x(to)Mtod])  (2.7) 

In  keeping  with  (2.4)  we  assert  that  state  equation  (2.7)  has  the  state  separation 
property 

x(x(to);u(ro;f])  - x(x(|);u(J,r))  to  < ^ < t (2.8) 

for  all  x(?o)  and  This  property  of  state  equation  (2.7)  is  one  of  its  key 
characteristics. 

To  illustrate  both  the  response  and  state  separation  properties  of  (2.4)  and 
(2.8)  let  ft  be  characterized  by  the  input-output  state  relation 

y(t)  = y(x(/o);«(toT|) 

= x(to)e~^'~^o^  + f (2.9) 

*''0 

The  state  equation  induced  by  (2.9)  is 

x(t)  = x(ro)e'‘'“'‘>’  + f (2.10) 
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Specifying  the  integration  variable  as  t,  where  /()  <t</,  equation  (2.9) can  be 
written  as 


y(/)  = + r + f (2.1 

•^'0 

However,  from  (2.10) 

x(t)  = + r e-<'-«>u(J)c/J 

Thus,  response  (2.9)  becomes 

y(t)  = x(T)e-^'-’''  + J 


1) 


(2.12) 


= y(x(T):u(T,0)  to  < T ^ t 


(2.13) 


which  illustrates  the  response  separation  property.  On  returning  to  (2.10)  the 
state  separation  property  (2.8)  assumes  the  form 

jc(0  = x(r)e~^'~'o^  + J e~^'~^^u(^)d^ 


= x(x(T);u(T.t)) 


(2.14) 


2.3  STATE  EQUATIONS 

The  ideas  of  Sections  2.2  can  be  combined  into  the  following  THEOREM  2.1 : 

If  f?  is  characterized  by  relations  of  the  form 

x(t)  = x(x(r());u(/oT|)  t > to  (2.15) 

y(f)  = v(x(r);u(r),r)  (2.16) 

where  x(r)  has  the  state  separation  property  (2.8),  then  x(r)  can  be  the  state  of 


..4.C. 
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Li  at  time  t,  and  (2.15)  and  (2.16)  the  sta/e  equations  of  Ct.  Proof  of  this 
theorem  is  seen  by  substituting  (2.1  5)  into  (2.16)  giving 

V(r)  = y|x(x(/o);u(t()./l).u(t).t|  (2.17) 

which  is  of  form  (2.5),  i.e., 

y(r)  = y(x(to);u(to./|)  (2.18) 

By  virtue  of  its  state  separation  property  equation  (2.15)  can  be  rewritten  as 

x(x(ro);u(t,).f|)  ==  x(x«);u(«.r|)  (2.19) 

On  substituting  (2.19)  into  (2.17)  we  have 

y(r)  = y[x(x(?);ua,r]),u(r),f]  (2.20) 

which  is  of  the  form 

y(r)  = y(x(|);u(?,r])  (2.21) 

However,  (2.21)  can  also  be  derived  directly  from  (2.1 7)  by  replacing  /q  with 
Thus, 

y(x(fo);u(toT])  = y(x(^);u(tr|)  (2.22) 

and  Cf  has  response  separation  property  (2.4).  Further,  since  x(^)is  determined 
by  x(/())  and  u(roT],  it  follows  that:  (2.18)  is  an  input-output-state  relationship 
for  (f.  with  x(r)  being  the  state  of  (f  at  time  r;  equations  (2.15)  and  (2.16)  are 
the  state  equations. 

Theorem  2.1  offers  further  insight  into  the  relationship  between  the  state 
separation  property  and  the  self-consistency  conditions;  it  provides  an  effective 
way  of  verifying  that  the  self-consistency  conditions  are  satisfied.  An  immediate 
inference  from  the  theorem  is  seen  in  the  following  COROLLARY:  If  ({  is 
characterized  by  differential  equations  o(  canonical  form 

x(f)  = x(x(/),u(r),r)  (2.23) 

y(r)  = y(x(r),u(/),/)  (2.24) 

where  (2.23)  has  a unique  solution  for  x(r),  then  x(/)  is  the  state  of  (f  at  time  t. 
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To  prove  this  corollary  we  first  integrate  (2.23)  between  the  limits  to  and  /: 

x(/)  = x(ro)  + f x(x(?),u(^),?)r/J  (2.25) 

''to 

which  is  an  implicit  form  of  (2.15).  (By  hypothesis  x(r)  is  determined  by  x(ro) 
and  U(ro,r|  .)  Equation  (2.25)  has  the  separation  property 

t 

x(x(t),u(?).?)J? 


The  conclusion  of  the  proof  of  this  corollary  follows  from  the  parent  theorem. 

State  equations  (2.23)  and  (2.24)  are  quite  general  and  cannot  be  readily 
used  in  their  present  form.  By  casting  them  in  a more  explicit  form,  which  is 
applicable  only  to  linear  systems,  we  can  state  that  if  Li  is  characterized  by 
linear  differential  state  equations 

x(f)  = A(/)x(r)  + B(r)u(r)  (2.26) 

y(r)  = C(r)x(r)  + D(r)u(r)  (2.27) 

where  A(f)...D(/)  are  time-dependent  (but  need  not  be)  matrices  then  x(r) 
qualifies  as  the  state  of  (}  at  time  t.  Equations  (2.26)  and  (2.27)  are  the  canoni- 
cal state  equations  of  the  linear  system  (f.  They  are  directly  deducible’  from 
(2.23)  and  (2.24),  respectively. 


3 
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3.1  INTRODUCTION 

To  further  solidify  the  ideas  developed  thus  far  we  will  establish,  by  definition, 
certain  “benchmarks”  relative  to  which  analytic  discussion  can  be  more  mean- 
ingful. In  particular,  these  benchmarks  will  include  definitions  of  the  zero, 
ground  and  equilibrium  states. 

The  zero  state,  say  state  6.  is  defined  as  that  state  where,  for  all  /q,  the 
system  response  to  zero  input,  starting  in  state  6,  is  a z,ero-valued  output  (null 
function).  Symbolically,  if,  for  all  Zq- 

V(0  = y(0;O)  = 0 (3.1) 

where  x(t())  = 6,  then  6 is  the  zero  state.  Tlie  zero  state  response  of  the  system  at 
time  t to  input  u is  the  response  with  the  system  initially  in  its  zero  state  6;  i.e.. 


y(r)  = y(0;u) 

= y(u)  (3.2) 
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The  idea  of  the  ground  state  coincides  witli  that  of  the  zero  state.  Consider 
a system  at  time  /o  ihs  initial  state  K(rn)=  Xn  and  let  the  applied  input  be 
zero-valued,  i.e.,  u = 0.  We  want  to  examine  x(t)  as  / Tliere  are  two  possi- 
bilities: (I)  x(r)  may  converge  to  a fixed  state,  say  xj,  or  (2)  x(t)  may  not 
converge.  We  will  restrict  our  attention  to  (1 ).  In  case  (1 ) two  possibilities  again 
exist  : (a)  X[  depends  on  X|),  or  (b)  xj  is  independent  of  X||.  The  ground  state  of 
interest  is  condition  1(b);  it  is  said  to  be  the  limiting  terminal  state  into  wliich 
the  system  eventually  settles  when  no  input  is  applied.  Symbolically,  to  say  state 
X|  is  the  ground  state  implies  that 


X|  = linix(x(i;0)  (3..^) 

/ -TOO 

where  xj  is  independent  of  X().  The  limit  (3.3)  is  unique.  Accordingly,  if  X| 
exists  it  also  is  unique.  In  conjunction  with  (3.2)  the  ground  state  response  of 
the  system  has  the  same  meaning  as  the  zero  state  response,  with  the  exception 
that  the  initial  state  is  xq  instead  of  0.  At  time  t the  ground  state  response  of  the 
system  to  input  u is 


v(0  = V(x();u) 


(3.4) 


In  addition  to  the  zero  and  ground  states  it  is  meaningful  to  establish  the 
identity  of  the  equilibrium  state.  Essentially,  a state,  say  6,  is  called  an  equilib- 
rium state  if  it  does  not  change  for  zero  input.  Symbolically,  to  say  6 is  the 
equilibrium  state  is  to  say  that 

B = x(B:0)  t > to  (3.5) 

where  x(B,0)  is  the  system  state  at  time  t > to,  given  that  the  state  at  time  to  is  0 
when  the  zero-valued  input  is  applied. 


3.2  TIME  INVARIANCE 

In  the  sequel  the  specific  systems  of  interest  will  be  those  that  are  time-invariant 
and  linear.  Time  invariance  can  be  conveniently  discussed  in  terms  of  a time 
translation  operator  T^.  The  action  of  the  operator  on  an  operand  is  to  shift  the 
operand  by  a fixed  amount  of  time  A along  the  time  axis.  Positive  and  negative 
A correspond,  respectively,  to  a delay  or  increase  of  A units.  For  A = 0, 
corresponds  to  the  identity  operator  I.  This  action  on  input  u is  depicted  in 
Figure  3.1 . 

Consider  a system  in  its  initial  stale  XQ.but  not  necessarily  the  zero  state  0. 
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Iq  I Iq*  ^ ^ 


Figure  3.1  Time  translate  operalor 


Tlie  system  is  said  to  be  time  invariant  with  respect  to  the  initial  state  xq  if, 
for  all  starting  states  xq,  all  inputs  u and  all  time  shifts  A, 

T^[v(x();u)]  = y(xo;T^u)  (3.6) 

Expression  (3.6)  is  interpreted  to  mean  that  the  response  to  T^u  with  the 
system  initially  in  xg  is  equal  to  the  shifted  response  Ta[v(xo;u)]  with  the 
system  initially  in  xg. 

As  special  cases  of  (3.6)  we  examine  zero  state  and  zero-input  time  invari- 
ance. A system  is  said  to  be  zero  state  time  invariant  if  for  all  inputs  u and  time 
shifts  ±A  the  zero-state  response  to  u is  a time  translate  of  its  zero-state  response 
to  the  translated  input,  i.e.,  for  all  A and  u equation  (3.6)  becomes 

v(TA(d;u))  = Ta(v(0;u)) 
v(Ta(u))  = TaV(u)  (3.7) 

Similarly,  a system  is  said  to  be  zero  input  time  invariant  if  for  all  initial 
states  xg,  all  initial  times  fg  and  all  time  shifts  A,  the  zero  input  response  to  u 
starting  in  state  xg  at  time  /g  ± A is  identical  to  the  zero  input  response  starting 
in  state  xg  at  time  fg.  This  implies  that  for  all  A and  x(fg),  equation  (3.6) 
becomes 

v(x(fg±A);0)  = TAy(x(fg);0)  (3.8) 


3.3  LINEARITY 

The  term  linearity  suggests  proportionality.  Of  interest  will  be  the  proportion- 
ality among  system  quantities;  specifically,  their  homogeneous  and  additive 
properties.  A system  is  said  to  be  linear  if  it  is  both  homogeneous  and  additive. 
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It  is  lioinogeneous  if  for  all  inputs  u the  zero  state  response  to  an  input  Cu  is  C 
times  the  zero  state  response  to  u,  where  C is  a constant.  We  have 

y(0;tu)  = CV(0;u) 

y(C’u)  = Cy(u)  (3.9) 

where  y(u)  is  the  zero  state  response  to  u.  Similarly,  a system  is  additive  if  for 
any  pair  (or  more)  of  inputs,  u’  and  u^,  the  zero  state  response  to  u'  + is  the 
sum  of  the  zero  state  responses  to  u*  and  u^: 

y(0;u'  +u^)  = y(0;u’)  + y(0;u^) 

y(u'  + u^)  = y(u')  + y(u^)  (3.10) 

On  combining  the  ideas  of  (3.9)  and  (3.10)  a system  is  said  to  be  zero  stale 
linear  if  and  only  if  it  is  both  homogeneous  and  additive.  Therefore, 

y(0;C(u’ + u^))  = Cy(0;u’)  + Cy(0;u^) 

y(C’(u'  + u-))  = Cy(u')  + Cy(u^)  (3.11) 

If  the  initial  state  is  xq,  then  the  system  is  linear  with  respect  to  the  initial  state 
X,)  if 

y(0;C(u' + u^))  = Cy(xo;u')  + Cy(xo;u2)  (3.12) 

holds  for  all  real  constants  C and  all  u in  the  input  segment  space. 

As  a further  consequence  of  (3.1 1 ) and  (3.12)  it  can  readily  be  shown  that 
if  a system  is  zero  state  linear  then  it  is  also  linear  with  respect  to  all  init>al  states 
reachable  from  the  zero  state.  However,  it  is  not  linear  with  respect  to  all 
possible  initial  states.  To  clarify  what  is  meant  by  linearity  w'e  add  the  property 
of  zero  input  linearity.  A system  is  zero  input  linear  if  its  zero  input  response  is  a 
homogeneous  and  additive  function  of  the  initial  states,  i.e.. 

y(Cxn;0)  = Cy(xn;0)  (3.13) 

y(x„'  +x„2;0)  = y(x„';0)  + y(x„2;0)  (3.14) 

for  all  X()  (i.e.,  Xq’.Xq^,.  . .)  in  the  state  space  4'. 

Equations  (3.13)  and  (3.14)  can  be  combined  giving 

y(ax„'  + x^y.O)  = Cy(xn'  ;0)  + Cy(x^^-O)  (3.1 5) 
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Clearly,  from  (3.12)  and  (3.15)  a general  definition  of  a linear  system,  which 
includes  both  /.ero  state  and  zero  input  linearity,  can  be  made  as  follows;  A 
system  is  linear  if  and  only  if 


(1 ) it  is  linear  with  respect  to  all  possible  initial  states,  i.e., 

v(0;O(u‘ +u^))  = Cy(xo;u')  + Cy(xo;u^) 

(2)  it  is  zero  input  linear,  i.e., 

y(axo' +Xo^);0)  = Cy(x„';0)  + Cy{x,?;0) 

As  an  adjunct  to  the  above  definition  another  basic  property  for  linear 
systems  can  be  deduced  from  (3.15).  Letting  C=  1 and  =0  we  have,  using 
(3.2), 

y(x();u)  = y(xo;0)  + y(0;u)  (3.16) 

The  system  response  to  u starting  in  state  xp  is  equal  to  the  zero  input 
response  starting  in  state  xq  plus  the  zero  state  response  to  u.  Property  (3.16)  is 
known  as  the  decomposition  property.  Since  this  property  was  derived  from 
(3.15)  it  follows  then  that  every  linear  system  has  the  decomposition  property. 
However,  the  converse  is  not  necessarily  true.  We  conclude  the  discussion  on 
linearity  by  stating  that  every  linear  system  must  he  zero  state  linear,  zero  input 
linear  and  must  possess  the  decomposition  property. 

Equation  (3.16)  is  another  form  of  our  initial  input-output-state  equation 
(1.6).  The  importance  of  linearity  is  to  provide  a relative  ease  in  determining  the 
system  response  to  a given  u.  Througli  the  decomposition  property  the  effect  of 
u is  separated  from  the  initial  excitation  as  represented  by  y(x(|.0).  As  will  be 
seen  later,  the  zero  state  response  y(0;u)  can  be  reduced  to  resolving  u into 
simpler  components  and  determining  the  zero  state  response  to  each  component 
separately. 


3.4  ZERO  STATE  OR  IMPULSE  RESPONSE 

By  (3.16)  we  established  that  the  input-output-state  relation  for  any  linear 
system  admits  to  a representation  of  the  form 

y(/)  = y(xo'.O)  + y(0;u)  (3.16) 

Focusing  our  attention  on  the  zero  state  response,  without  loss  ol  general- 
ity we  let  0 = X()  and  u = 6(t-  ?),  a unit  impulse  (delta  function).  We  chose 
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u = 6(r-  ?)  since  any  arbitrary  signal  u can  be  reduced  to  a series  of  elementary 
functions,  such  as  an  impulse  (see  Appendices  A and  B).  Therefore  the  problem 
of  determining  the  response  y(0;u)  to  an  arbitrary  input  is  reduced  to  finding  the 
system  response  to  the  elementary  signal.  The  second  term  on  the  right  in  (3.16) 
becomes 


or  in  scalar  form* 


v(0;u)  = y(u)  = y(6(f-  ?)) 


y(0  = >’(«(/ - 


A fixed  (time  invariant)  system  can  be  identified  with  an  operator,  say  //, 
wherein  the  system  is  entirely  characterized  by  its  response  at  time  f to  a simple 
impulse  8(r-  |)  applied  at  any  instant  of  time  Accordingly,  we  can  write  for 
(3.18) 

y{t)  = >'(u)  = //(u(f)l  (3.19) 

= //|6(r  - 1)1  (3.20) 


where  fr(/,|)  is  the  zero  state  response  at  time  / to  a unit  impulse  6(/  - f)  applied 
at  time  Accordingly,  /i(/,?)  is  also  called  the  impulse  response.  From  (B.4). 
however. 


M(/)  = tz«)6(r- ?)r/J 


Therefore,  substituting  (3.22)  in  (3.19)  we  have 


which,  due  to  the  homogeneous  and  additive  properties  of  linear  systems,  can  be 
written  as 


1.  l*or  convenience  in  the  analysis  wc  will  first  examine  the  input*outpui*statc  equations  in 
scalar  form. 


r ^ 
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^(r)  = r H[8(t-^)]ua)d^ 

oo 


= f h(t,^)u(k)d^  (3.24) 

— oo 

Integral  (3.24)  can  be  viewed  as  a summation  of  responses  to  impulses  applied  at 
time  ^ of  strength  «(?)c/?,  with  ^ varying  over  the  time  internal  (-°°,°°).  As  a 
result  it  is  often  referred  to  as  the  superposition  integral. 

To  qualify  the  zero  state  response  h(t,^)  as  time  invariant  it  must  be  of  the 
form  h{t  - I).  This  fact  becomes  evident  by  writing  (3.24)  as 

1 

y(t)  = f h(t-k)m<i^  (3.25) 

Shifting  the  input  u(J)  in  time  by  A units  the  output  at  time  f + A becomes 

Jt+is 

/t(/  + A- ?)«(?- A)rf? 

<0+A 

We  see  that  the  zero  state  response  at  time  r + A is  equal  in  magnitude  to  the 
zero  state  response  at  time  t,  which  satisfies  condition  (3.7).  Hence,  we  assert 
that  a linear  system  is  zero  state  time  invariant  if  its  impulse  response  h(t,^)  is  of 
the  form  h{t  - ?).  Therefore  a system  obeying  the  input-output  relation  (3.19)  is 
time  invariant  if 

y(f-?)  = //[«(/-?)]  (3.26) 

For  multidimensional  inputs,  where 

t 

' 

u(t)  = 2-.Ui(t) 

I each  component  W((0  may  be  expressed  as 

Ui(t)  = I u,(l)6(/-  k)di  (3.27) 
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Tile  response >'/(/)  = v,(i<)  resulting  from  input  Uj(t)  is 


v,(/)  = I 


i = 


where  is  the  response  at  output  terminal  i occurring  at  time  t due  to  a 

unit  impulse  applied  to  input  terminal  / at  time  The  resultant  output  K/(/)at 
the  ;th  terminal  due  to  all  inputs / = 1 ,2,..., A:,  is 


k oo 


In  matrix  from  (3.29)  becomes 


y(r)  = v(u)  = 


where  H is  the  matrix  of  all  zero  state  responses  at  time  / due  to  all  the  unit 
impulses  applied  at  time 

It  is  often  analytically  convenient  to  express  h(t,^)  in  the  frequency 
domain,  i.e.,  as  a transfer  function.  Accordingly,  the  Laplace  transform  is  the 
analytic  tool.  For  systems  where  the  impulse  response  is  h(t  - |)  the  transfer 
function  H(s),  where  s is  the  complex  frequency,  is  defined  as  the  Laplace 
transform  of  hit): 


H(s)  = £{//(/)} 


h{t)e-^<dt 


j h{t-  k)e~^'‘‘'^'*dt 


3.5  ZERO  INPUT  RESPONSE  AND  BASIS  FUNCTIONS 

We  next  examine  the  first  term  on  the  right  in  (3.16),  i.e.,  the  zero  input 
response.  Reflecting  on  (3.5)  we  see  that  the  zero  input  response  is  synonomous 
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to  the  (equUibrium)  state  at  time  t beginning  in  xq  at  time  /q  when  the  zero- 
valued input  is  applied.  Our  goal  will  be  to  more  explicitly  define  y{xo;0)  in 
terms  of  the  basis  vectors  which  span  the  state  space  In  doing  so,  however,  we 
will  revert  to  scalar  quantities  to  make  the  analysis  more  convenient. 

Let  the  linear  vector  space  defining  the  vector  y(xo;0)  be  spanned  by  the  k 

basis  vectors  a/  (i  = 1 ,2 k),  where  the  scalar  product  of  any  two  unit  vectors 

a,  and  a/  obeys  the  relation 

a,- -a/  = <a,la/>  = 6,y  (3.32) 

The  vector  x(/)  can  be  represented  as 

k 

x(/)  = I2a,x,(t)  (3.33) 

1 = I 

The  scalar  functional  of  the  vector  x(/)can  be  written  as  the  linear  combination 

k 

F(x(t))  = H^jXj  (3.34) 

1=1 

where  the  ip,-  are  scalar  constants  or  functions.  Consequently,  the  scalar-valued 
zero  input  response  must  be  of  the  form 

* 

>'(xo;0)  = Z]ip,(/o-0^,(fo) 

1 = 1 


where 


*0  = ixoKx^,---,x^) 

and  (/■  = 1,2 ,k),  play  the  role  of  constants  for  fixed  t and  to- 

The  have  a simple  interpretation.  They  form  the  elements  of  a k- 
dimensional  vector  0,  where 


k 

0 = HanPjUo.t) 

1 = 1 


- (i^i(fij-0.y’2(^)’^) V’t(fo-O) 


(3.36) 
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and  correspond  to  the  a,-  of  (3.33).  Thus,  we  can  write 

j’(a,;0)  = v’,(/o.O  i = 1.2 k (3.37) 

that  is,  is  the  zero  input  response  starting  in  state  a,  at  time  /q- 

Since,  by  definition,  the  a,  are  linearly  independent,  we  demand  that  the 
functions  also  be  linearly  independent.  They,  therefore,  constitute  a set  of 
basis  functions  for  the  system.  The  vector  <t>  defined  by  (3.36)  is  the  basis 
function  vector  for  the  system.  Accordingly,  the  scalar-valued  zero  input  re- 
sponse (3.35)  can  also  be  written  as 

y(x„;0)  = <0|xn>  (3.38) 


3.6  INPUT-OUTPUT-STATE  RELATION  AND  BASIS  FUNCTIONS 

Returning  to  (3.16)  the  scalar  form  of  the  input-output-state  relation  for  a 
linear,  time  invariant  system  admits  to 

y{t)  = (0lxo>  + r h{t-%)u{X)d\  (3.39) 

*''0 

As  such  (3.39)  satisfies  the  self  consistency  conditions  of  Sections  1.11,  1.12 
and  1.13.  Equivalently,  it  has  the  separation  properties  of  Sections  2.2  and  2.3. 

The  choice  of  basis  functions  v?,-  which  satisfy  (3.39)  is  constrained,  since 
they  must  induce  a relationship  between  themselves  and  the  impulse  response 
h{t  - ^).  We  shall  now  embark  on  examining  these  constraints  more  closely  for 
the  purpose  of  deducing  state  equations  in  a more  explicit  form. 

We  begin  by  assuming  that  the  basis  functions  are  infinitely  differ- 
entiable and  that  the  derivatives  are  continuous  over  the  (finite)  observation 
time  of  interest.  The  response  separation  property  of  Sections  2.2  and  2.3  spe- 
cify that  in  the  time  interval  ro  < t < r the  zero-input  response  at  time  t starting 
with  state  x(/q)  niust  be  the  same  as  the  response  at  time  t starting  in  state  x(r). 
Using  this  property  the  basis  functions  must  therefore  satisfy  the  (time  transla- 
tional) requirement 

<0(/-  rolIxC^o))  = (3.40) 

for  all  /o,  t and  r.  Clearly,  on  letting  to=0  and  x(0i)  = a|,  equation  (3.40) 
reduces  to  (for  an  «-dimensional  vector) 


t 
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n 

T)Xj(T)  (3.41) 

1 = 1 

Hquution  (3.41 ) implies  that  each  basis  function  can  be  represented  as  a linear 
combination  of  the  set  of  delayed  bases  functions  ip,(/  - t),  (i  = 1,2, . . .,n).  This 
is  the  translational  property  of  basis  functions.  On  differentiating  (3.40)  n - 1 
times  we  have  the  following  n equations: 

<0(a-i)(,_  ,p)|x(ro)>  = 7)|x(t)>  (3.42) 


The  superscript  signifies  the  differentiation.  Equations  (3.42)  can  be  written  in 
compact  matrix  form  as 

<!>(/-  fo)>«(^o)  = 4>(r-  r)x(r)  (3.43) 

where  <t>  is  an  n X n matrix.  The  rows  of  <I>  are  the  vectors  0,0^*^ 


(p,(0  <P2(t)  V’„(0 

•••  ^„<"-»(r) 


By  setting  / = t in  the  right-hand  side  of  (3.43)  the  relationship  between  x(r)  and 
x(fo)  is  established  as 

<K0)x(t)  = <I>{t- foW^o)  (3-45) 

If  <l>(0)  is  nonsingular  then  x(t)  is  determined  uniquely.  The  basis  functions 
can  be  normalized  giving  4>(0)=  I,  where  I is  the  identity  matrix. 
The  normalized  set  of  basis  functions  is  found  through  the  transformation 

0'  = 0$'*(O)  (3.46) 

Equation  (3.45)  thereby  reduces  to 

x(t)  = 4>(f  - /o)x('o)  (3.47) 


42 


AN  INTRODUCTION  TO  THE  THEORY  OF  LINEAR  SYSTEMS 


We  see  from  (3.47)  that  <!>(/-  to)  is  an  operator  which  transforms  the  initial 
state  x(/())  into  the  state  x(/).  Hence,  4>(r  - to)  referred  to  as  the  state  transi- 
tion matrix. 


3.7  STATE  EQUATIONS 

In  the  preceding  discussion  the  separation  property  of  the  zero  input  response 
was  used  to  establish  the  basis  function  (derivatives)  composition  of  the  state 
transition  operator  <l>.  We  will  now  consider  the  more  general  situation  of  a 
non-zero  input  and  the  relations  between  the  state  transition  matrix  (basis  func- 
tion) and  the  impulse  response  h(t.^). 

In  the  general  case  the  response  separation  property  of  Sections  2.2  and 
2.3  imply  that  the  input-output-state  equation  (3.39)  must  satisfy  the  time 
translation  identity 

<0(r-  /o)l*(fo)>  + f ¥t- 

'^'0 

= <0(r- t)|x(t)>  r //(r  - J)iv(S)t/|  (3.48) 

for  all  /O'  > ^0  and  all  r,  where  to<:T<t.  (Note  that  by  letting  u(t)  = 6(r) 

and  setting  x(/o)  = 0,  to  = 0,  equation  (3.48)  yields 

h(t)  = <0(f- t)1x(t)>  , (3.49) 

This  shows  that  the  impulse  response  is  a linear  combination  of  the  basis  func- 
tions.) Differentiating  (3.48)  with  respect  to  r gives 


ft(t-  t)u(t)  = t)1x(t)> 


= -<0(r-  t)|x(t)>  + <0(t-  t)|x(t)>  (3.50) 
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Now  dilTerentiating  (3.50)  n - 1 times  with  respect  to  t results  in  n expressions 
as  follows: 


/j(a  '>(r  - t)|x(t)> 


= t)Ix(t)>  + <0'“  "(/- t)I)((t)>  (3.51) 


where  a=  1,2, ...,n,  denotes  the  order  of  the  time  derivatives.  The  set  of  equa- 
tion (3.51)  can  be  represented  more  compactly  in  matrix  form  as 


h(f-T)u(T)=  ^l<l>(r- t)x(t)1 


(3.52) 


where  4>  is  the  state  transition  matrix  defined  in  (3.44)  and  h is  the  column 
vector 


hit) 


h(f)  = 


f/(»(0 


(3.53) 


Integrating  (3.52)  between  the  limits  to  and  t gives 


f} 

'o 


h(r  - t)u(t)c/t 


- t)x(t) 


= <P(t-  r)x(T)l'^ 

= 4>(0)x(r)  - 4>(r-  fo)x('o) 

Making  use  of  the  fact  that  4>(0)  = I the  above  integration  yields  the  state  equa- 
tion 


x(r)  = <!>(/-  to)x(to)  + r h(/-  |)!v(^)t/J  (3.54) 

■^'0 

Equation  (3.54)  is  the  state  equation  in  explicit  form  for  a linear  time-invariant 
system.  It  expresses  the  system  state  at  time  r as  a function  of  the  initial  state  at 


I 


i 


! 


i 
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( 

L 


r 

f 


time  f()  and  the  input  over  the  interval  (to-fl-  For  zero  input  (3.54)  reduces  to 
(3.47). 

Earlier  we  established  (3.39)  as  the  scalar-valued  input-output-state  rela- 
tionship for  a linear  system.  On  differentiating  (3.39)  n - I times  with  respect  to 
t we  have 

y(t)  = <0(r-  ro)lx(to)>  + f h(t-  ^)u(^)d^ 

'^'0 


y(”-»(/)  = /o)lx(/o)>  + f ?)u(l)r/? 

Comparing  the  results  with  (3.54)  where 

x(t)  = <J)(f-  fo)x(fo)  + f h(r-  J)n(?)c/? 

'^<0 

it  can  be  concluded  that 

x(/)  = (>’(0y'(0 /"-'*(/))  (3-55) 

Tlte  above  relationship  is  valid  only  if  /t*”""(t)  does  not  contain  delta  functions 
at  r = 0.  Further,  this  relationship  applies  only  to  the  system  characterized  by  a 
differential  equation  of  the  form 

+ ...  + KqT'  = -F  ...  + m(,M 

where  8 and  m are  constant  coefficients. 

Returning  to  (3.54)  we  examine  the  response  when  the  input  is  m = 6(r). 
Setting  x(Zo)  = 0 and  Zq  = 0 we  have 

h(0  = x(Z)  t > 0 

hit)  = 0 t <0  (3.56) 

Equations  (3.56)  are  interpreted  to  mean  that  h(r)  is  the  system  state  at  time  t 
given  that  the  system  was  initially  in  its  zero  state  at  time  fn  when  excited  by 
the  impulse  6(r).  Tlius  h is  referred  to  as  the  state  impulse  response. 
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State  etiuatioii  (.^54),  wiiich  expresses  the  system  state  at  time  t in  terms 
of  the  initial  state  at  time  to  and  the  applied  input,  can  be  readily  reformatted 
into  a differential  form.  Returning  to  (3.5! ) and  setting  r = /,  we  have,  remem- 
bering that  <I>(0)  = I if '(>(0)  is  nonsingulat. 


x(r)  = *i>(0)x(r)  + h(0)u(r)  (3,57) 

which  is  the  state  equation  in  differential  form.  4>(0)  relates  to  the  state  transi- 
tion matrix  <J>(r),  where  specifically  <I>(r)  is  the  solution  of  the  differential 
equation 


4>(f)  = <i>(0)<I>(r) 


(3.58) 


Thus,  for  a system  characterized  by  input-output-state  relation 


y(r)  = <<Hr  - to)lx(f|))>  + 


■''0 


/;(r- 


t > to 


the  state  equatioits  in  differential  form  are,  for  all  t, 

x(t)  = <i>(0)x(r)  + h(0)M(r)  (3.59) 

y(r)  = <0(O)|x(/)>  (3.60) 

Accordingly,  the  state  at  time  t is,  by  (3.54). 


x(r)  = 4>(t-  to)’‘(/o)  + 


f h(r- 

to 


4 

Canonical  Formulation 


4.1  INTRODUCTION 

The  canonical  forms  (2.26)  and  (2.27)  of  the  state  equations  provide  for  the 
central  idea  of  associating  a state  vector  with  a linear  differential  system.  The 
resulting  products  of  the  method  described  below  are  expressions  for  the  state 
vector  along  with  expressions  for  the  matrices  A,  B,  C and  D of  (2.26)  and 
(2.27). 

In  general  a linear  differential  system  can  be  characterized  by  an  input- 
output  relationship  of  the  form 

l.{D)y  = M{D)u  (4  1) 

where  the  operators  I.{D)  and  M(D)  are 

L(D)  = i„iy'  + + •••  + «() 

M(D)  = 
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D‘  = J'lJi*  ((=1,2 k n) 

D'  =f()dt 


The  coefficients  C and  m are  constants,  and  not  necessarily  real.'  liquation  (4.1 ) 
can  also  be  formulated  as 


for  differential  operator  systems,  or  as 


1.(0)}’  = u 


(4.2) 


(4.3) 


for  reciprocal  differential  operator  systems.  Specifically,  our  procedure  of 
attaching  a state  vector  to  differential  systems  will  be  to  solve  expressions  (4.1 ), 
(4.2)  and  (4.3),  and  relate  the  resulting  coefficients  t and  m to  the  components 
of  the  state  vector. 


4.2  RECIPROCAL  DIFFERENTIAL  SYSTEM 

For  reciprocal  differential  systems  we  seek  the  solution  to  (4.3); 

L(D)y  = u 

where 

I.(D)  = $!„£»«  + + ...  + Co 

To  solve  (4.3)  we  will  make  direct  use  of  the  Laplace  transform.  Tlie  transform 
of  the  nth  derivative  of  a time  function  >'(r)  can  be  written  as 


(/«>■{/)' 


dr" 


= s"K(s)  - s"-ly(0)  - s"-2Z)v(0)  - ...  - O"-^y(0) 


= s>’y(s)  - s"-'y(0)  - s''-2v(l)(0)  - ...  _ v4"-"(0) 

I.  In  general  the  index  used  for  both  coefficients  c and  m,  i.c..  and  are  not  the 
same.  It  is  recognized  that  the  order  of  the  differential  operators  associated  with  u and  y 
will  be  different  from  one  another:  however,  there  is  no  loss  in  generality  in  using  a single 
subscript  m.  This  will  be  done  in  the  sequel  for  convenience  only. 
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where  s is  the  complex  frequency,  is  the  rth  derivative  of  .t’(O),  and  Y(s)  is 
the  transform  of y(/).  The  transform  of  both  sides  of  (4.3)  gives 

I + ,..  + f!o)ns)  = U(s)  + 

+ (B„s"+6„_|4-"-'  + ,..  + 8„)/«-2)(0) 

+ ...  + (C„s«  + 8„_|S«-‘  + ...  + eo) 

(4.4) 


On  letting 


l.{s)  = «„s«  + + ...  + Ko 

equation  (4.4)  reduces  to 


n.) . f:’  » £ 

L(s)  ,=  I 


uT)  • 


,(r- 


'>(0) 


(4.5) 


The  inverse  Laplace  transform  of  both  sides  of  (4.5)  gives  the  time  dependent 
output  as 


" c' 

y(t)  = L v<'-»(0)v;,.(r)  + I hit  Oua)dk  (4.6) 

<•=1  X 


where 


hit)  = T{//(s)} 
ffis)  = \HAs) 


and 


^i(t)  = £ 


-1 


lAs) 


The  time  functions  y),.  satisfy  the  differential  equation 

UD)^i  = 0 


(4.7) 
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whereas  h satisfies 


l.(D))i  = 0 

Clearly  the  yj,-  are  linearly  independent  and  qualify  as  a set  of  basis  func- 
tions for  the  system  characterized  by  (4.3).  It  follows  that  (starting  at  rp  = 0) 
the  input-output  pair  (u.y)  satisfying  (4.3)  also  satisfies  (4.6),  since-(4.6)  is  an 
extension  of  (4.3).  A closer  examination  of  this  relationship  reveals  that  for 
every  input-output  pair  satisfying  (4.3)  there  corresponds  an  n-tuple  of  complex 
numbers  (X].X2 X„)such  that  (u,y) satisfies  (4.6)  for  / >0  with 

y('  ’>(0)  = X,  / = l,2,...,«  (4.8) 

(Conversely,  for  every  /(-tuple  (X|,...,X„)  the  expressions  for  v given  by  (4.6), 

where  each  is  replaced  by  X/  (/ = 1,2 n).  defines  an  input-output 

pair  for  (4.3);  i.e.,  (4.6)  is  a solution  of  (4.3)  for  each  Xi,X2 X„.)  By  virtue  of 

the  constancy  of  the  coefficients  of  (4.3)  and  the  time  translation  properties  of 
linear  time-invariant  systems  tire  time-shifted  pair  satisfying  (4.3)  also 

satisfies  (4.6).  Therefore,  the  general  solution  to  (4.3)  for  arbitrary  Iq  can  be 
written  as 


,v(0 


n 

Zv 

1=1 


(/- 


»(fo)^,-(/-fo)  + r (4.9) 

'^>0 


where  t^. 

The  state  vector  x(C)  must  now  be  fixed  to  the  system.  This  can  be  done 
for  linear  time-invariant  systems  by  relating  the  constants  of  (4.3)  with  the 
components  of  x(/o)-  A straightforward  way  to  do  this,  though  not  necessarily 
the  most  advantageous  method  for  all  cases,  is  to  relate  the  basis  functions  with 
the  respective  ifi,  of  (4.9).  From  (3.S5)  the  components  of  x(/o),  which  are  the 
coefficients  of  the  basis  functions,  are 

-^l(fii)  = .v(fo) 


X2O0)  = v^'Htn) 


Xn(fn)  = .v'''-”(to) 


We  readily  identify  x(/o)  as 

x(fii)  = (.v(fn)-.v“*(to) /"-"(to))  (4.10) 


.4, 
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Digressing  temporarily  an  alternative  method  of  establishing  the  compo- 
nents of  the  state  vector  x(ro)  is  seen  from  expression  (4.5>.  Tire  Laplace  trans- 
form of  the  respective  basis  functions  gives 

s>'-'  ^ j_ 

L(s)  ’ l.(s)  ’ l.(s)'  L(s) 

Thus,  the  components  of  x(ro)  are 

X2O0)  = + ‘i«-iv(fo) 

Jf3(^n)  = + «n-2y(to) 

XnOn)  = + eiT(fo) 

Returning  to  (4.10)  the  terms  y^‘  D (/  = \,2,...,n)  in  (4.9)  are  each  re- 
placed by  their  corresponding  expressions  from  (4.10).  Accordingly,  the  solution 
to  (4.3)  for  arbitrary  /q  becomes 

>-(/„)  = <0(/-ro)lx(/o)>  + ( hit  - ^)u(^)d%  (4.11) 

which  is  in  agreement  with  (3.39).  By  (4.1 1)  the  state  vector  x(ro)  has  been 
fixed  to  the  system. 

To  complete  the  analysis  of  the  reciprocal  differential  operator  system  it 
now  remains  to  show  that  x(r)  qualifies  as  a state  vector.  We  form  x(f)  and  y(r). 
If  the  resultant  expressions  are  of  the  canonical  form  (2.26)  and  (2.27), 

x(f)  = Ax  + Bu 

y(t)  = Cx  + Du 

then  x(r)  qualifies  as  a state  vector.  Considering  only  fixed  systems  (systems 
where  A.  B,  C and  D are  constant  matrices)  it  follows  from  (4.10)  that  at  time  t 


and 


x(0  = (/(0,/*H0.--  ■>’^”"’H/)) 
x(/)  = (vC>(/),j(2)(,) _y(«)(,)) 


(4.12) 


(4.13) 
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From  (4.3)  the  general  expression  involving  the  «th  derivative  of  v(/)  is 

v‘">(0  = to.v(O) 

Substituting  the  above  expression  into  (4.1 3)  gives 

x(0  = 

(4.14) 

Thus,  the  components  of  x(f)  are  a linear  combination  of  x(r)  and  u(r).  Hence, 
from  equations  (4.12),  (4.13)  and  (4.14)  the  components  of  x(r)  are 

^l(0  = X2it) 

X2i‘)  = Xiit) 

8o  1 

<u)  = -rxi(t)  - ...  - -^x„(t)  + ~u{t) 

y(r)  = x,(f)  (4.15) 


Equations  (4.15)  expressed  in  matrix  form  become 
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or 

x(r)  = Ax  + 

Bu 

y(r)  = Cx  + 

Du 

= <C|x> 

(4.18) 

where 


C = (1  ...  0]  D = 0 (4,19) 

Thus,  x(r)  as  defined  by  (4.12)  qualifies  as  the  state  vector  for  the  reciprocal 
differential  operator  system  characterized  by  equation  (4.3). 

4.3  DIFFERENTIAL  OPERATOR  SYSTEM 


The  differential  operator  system  characterized  by  (4.2)  is  of  interest  for  two 
reasons;  It  is  possible  to  realize  any  differential  system  as  a combination  of  both 
the  reciprocal  differential  and  the  differential  operator  types.  Secondly,  each  is 
the  inverse  of  the  other.  For  the  system  of  (4.2)  we  seek  the  general  solution  to 

V = M(D)u 


where 

M(D)  = ntnO"  + + 

The  Laplace  transform  of  both  sides  of  (4.2)  gives 


+ mo 


where 


L{m„s>’-'  + .. 
/ = ! 

, + n,)M<'-l>(0) 

(4.20) 

+ + . 

,,  + mo 

(4.21) 

• 


CANONICAL  FORMULATION 


53 


and  U(s)  = £{u(t)}  (4.22) 

The  inverse  transforni  of  (4.20)  gives  the  time-dependent  output,  for  fo 

v(0  = H ^,(t)«(‘-'>(0)  + h{t- (4.23) 
1 = 1 


Pi(t)  = X-Km„s"  +m„_is"  ' + ...  + «,} 


h{t)  = X-‘{M(s)}  (4.25) 

From  (4.23)  the  n basis  functions  pj  are  linearly  independent.  They  represent 
the  zero-input  responses  to  the  differential  operator  system.  Similarly,  h(t)  is  the 
system  impulse  response.  Relating  the  basis  functions  with  the  respective  ip,-  we 
let,  from  (4.23), 

xi(0  = u(t) 

X2(t)  = uW(t) 


x„(t)  = «("-l>(f) 
Thus,  the  state  vector  for  the  system  of  (4.2)  is 
x(z)  = (u(r),t/(’\0> 
Forming  the  expression  for  x(f) 

x(f)  = (m(1)(/),«^^H0-  - 
and  comparing  it  with  that  for  x(t)  we  see  that 
xi(f)  = u 
X2O)  = = xi 

xj(t)  = = X2 

x„(t)  = = x„_i 


**  r ' • 
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Hence,  the  input  output  relation  (4.2)  can  be  written  as 
y = mi)U  + + ...  + 

= mo>^l  + ^1,^2  + 

= mo->fi  + '”1^1  + ••■  + m„x„  (4.29) 

Equations  (4.28)  and  (4.29)  comprise  the  state  equations  for  the  reciprocal 
differentia]  operator  system.  Equation  (4.28)  is  the  corresponding  state  vector. 

In  the  representation  outlined  above  the  state  equations  representing  the 
differential  operator  system  are  not  of  the  canonical  form 

X = Ax  + Bu 
y = Cx  + Du 

The  corresponding  input-output  relation  for  arbitrary  rp  obtained  by  sub- 
stituting for  each  in  (4.23)  the  corresponding  Jc,(0)  as  determined  by 

(4.28).  The  result  is 


y{t)  = to)XiUo)  + + ^n(t  - to)x„(to)  + f h(t-  ?)n(|)t/? 

■’to 


= m-  fo)lx(fo))  + f h(t-  |)u(?)d| 
■’to 


(4.30) 


4.4  GENERAL  SOLUTION 

We  seek  the  solution  to  the  generalized  equation  (4.1): 

L(D)y  = M(D)u 


where' 


L(D)  = t„D>t  + + ...  + «o 


M(D)  = m„Df  + in-iDf-i  + ...  + Co 


1 . In  general  the  order  of  the  coefficients  for  C is  not  the  same  as  those  for  m.  The  common 
order  n is  being  used  for  convenience  only. 


•If”  vw 
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The  Laplace  transform  of  both  sides  of  (4.1)  gives 
A(i)T(i)  = M(s)U(s)  + J(s) 

K(s)  = M(s)U(s)  + L~\s)J(s)  (4.31) 

The  ratio  H(s)  is  defined  as 

H(s)  = M(s)IL(s) 

and  the  polynomials  L{s)  and  M{s)  were  previously  given  as 
Us)  = + ...  + 8o 

Af(s)  = + ...  + mp 

J(s)  is  the  residual  polynomial  resulting  from  the  Laplace  transform  of  u(t)  and 
y(t)  at  time  / = 0.  When  arranged  in  powers  of  s the  polynomial  J{s)  is 

J(s)  = s""'[e„.v(0)  - m„«(0)] 

+ s"-2[8„£^(0)  - m„Du{0)  + e„-i.v(0)  - m„_iu(0)] 

+ [«„Z)''-lv(0)  - m„£)"-'M(0)  + ...  + 8i>'(0)  - wiu(O)] 

(4.32) 

The  state  vector  x(/)  can  be  established  by  relating  the  components  of 
x(/q)  to  the  components  of  J(s)  for  /q  = 0.  Equating  JC](0)  to  the  coefficients  of 
^2(0)  to  the  coefficients  of  etc.,  we  have 

jfl  = - m„u 

■*2  = - ^nDu  + i„-iy  - m„^iu 

x„  = i„D’'~^y  - m„D''~^u  + ...  + 8i_v  - ni\u  (4.33) 
Equations  (4.33)  can  also  be  written  as 

xi  = S„y  - m„u 

X2  = xi  + e„_iJ  - 

Xn  = x„-i  + «i/  - miu  (4.34) 
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and  from  (4.1 ) it  is  clear  that 

Xn  = - wo“)  (4.35) 

Solving  equations  (4.34)  and  (4.35)  for  the  various  Xj  and  y in  terms  of  the 
corresponding  JC/  and  u gives 


Xn-\  = ^[-^1^1  + ~ 


1 


Xn  = %mnM 


(4.36) 


and 


= TtI-*’!  + '”n"l 

'^n 


(4.37) 


To  qualify  x(r)  as  the  state  vector  it  is  necessary  to  demonstrate  that  x,  x,>’  and 
u satisfy  the  state  equations  (2.26)  and  (2.27).  By  inspection  (4.36)  and  (4.37) 
are  of  the  form 

X = Ax  + Bu 
y = Cx  + Du 

where 


A = 


e 

1 

L.  — 

0 

..  0 ■ 

-e«-2 

0 

8«  ■ 

0 

-ei 

0 

0 

..  c„ 

,-«0 

0 

0 

0 

® = r 


6n'"n-l  - 
^n^^n-2  ~ 


?„wio  - %>n„ 


• 
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1 

■ 

'm„' 

c = 

F ° ■ 

, Vi 

..  0 

D = 

(4.38) 


Thus,  x(7)  as  defined  by  (4.33)  qualifies  as  the  state  vector  representing  a system 
characterized  by  differential  equation  (4.1).  The  system  matrices  A and  B are 
constant  n X n and  n X 1 matrices,  respectively.  Operators  C and  D are  the 
coefficients  of  T(Z3)  and  M(D).  respectively. 

It  was  previously  seen  from  (3.39)  that  the  system  output  is 

y(0  = <0(7-  /o)lx('o)>  + f h(t- ?)u(|)t/? 

''lo 

The  scalar  product  above  identifies  the  zero-input  response  as 
<0|x>  = ip,.X,  + ^2^2  'Pn^n 

Using  representation  (4.33)  and  (4.34)  equation  (4.31 ) can  be  written  as 

n 

Y(s)  = //(s)f/(s)  -t  (4-39) 


The  components  x,-  of  the  state  vector  are  identified  as  the  coefficients  of  the 
respective  s"”'  in  (4.33).  Applying  the  inverse  Laplace  transform  to  both  sides  of 
(4.39)  and  comparing  the  results  with  (3.39)  the  zero-input  responses  are 


Pi 


Us) 


i = 1, 2,  ...,n 


(4.40) 


The  responses  Pn  ^re  linearly  independent  and  constitute  a set  of  basis 

functions  for  the  system. 


4.5  SYSTEM  REALIZATION  AND  EQUIVALENCE 

Given  a linear  system  characterized  by  a single  differential  equation  of  the  form 
(4.1)  one  can  always  construct  (realize)  an  equivalent  system  comprised  of  an 
interconnection  of  adders,  scalors,  integrators  and  differentiators.  In  establishing 
the  meaning  of  the  equivalent  system  it  is  assumed  that  the  reader  is  familiar 
with  the  analytic  definitions  of  adders  and  scalors.  Thus,  further  elaboration  on 
these  two  terms  is  not  warranted.  For  our  purposes  we  will  proceed  to  define 
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integrators  and  differentiators.  An  integrator  is  a system  whose  input-output 
relation  admits  to  the  form 


(D*a)y  = u (4.41) 

where  the  constant  a is  not  necessarily  real.  The  integrator  is  denoted  by 
1/(Z)  -Fa).  The  state  of  an  integrator  at  time  t is  proportional  to  its  output: 

x(t)  = kyU) 

where  k is  the  constant  of  proportionality. 

A differentiator  is  a system  whose  input-output  relation  admits  to  the 

form 


y = (D  + b)u  (4.42) 

where  b is  also  a constant  not  necessarily  real.  The  differentiator  is  denoted  by 
(Z)  -F  b).  The  state  of  a differentiator  at  time  / is  proportional  to  its  input: 

x(t)  = k'uit) 

A vector  x whose  components  are  the  outputs  of  the  integrators  and  the 
inputs  to  the  differentiators  qualities  as  a state  vector  of  the  equivalent  system. 
(It  is  immediately  obvious  that  by  letting  the  output  of  an  integrator  also  serve 
as  the  input  of  a differentiator  the  dimensionality  of  x for  the  equivalent  system 
is  reduced.  Although  a dimensional  reduction  of  x is  highly  desirable  from  a 
computational  point  of  view  an  arbitrary  reduction  or  assignment  of  state  com- 
ponents may  prove  to  be  futile  in  providing  for  a solution  of  the  input-output- 
state  or  canonical  equations.)  As  a general  rule  the  procedure  for  associating  a 
state  vector  with  a system  comprised  of  adders,  scalors,  integrators  and  differen- 
tiators involves  first  assigning  a component  of  x to  the  output  of  each  integrator 
followed  by  assigning  a component  of  x to  the  input  of  each  differentiator  not 
connected  to  the  output  of  an  integrator  through  a scalar.  This  rule  is  ex- 
emplified diagrarnatically  in  Fig.  4.1,  which  shows  integrators  7i  and  I2  inter- 
connected with  differentiators  D\  and  D2.  Tlie  adders  are  denoted  by  the  + signs 
and  the  scalors  by  the  constants  A.i  and  ki-  State  vector  components  X\  and  xj 
are  assigned  to  the  outputs  of  integrators  /]  and  /2,  respectively.  The  remaining 
component.  ;C3,  is  assigned  to  the  input  of  differentiator  D\.  Note  that  the  input 
to  differentiator  D2  is  directly  connected  to  the  output  of  integrator  I\.  Tlius  no 
component  is  assigned  Di-  The  state  vector  x consists  of  components  xj,  xi  and 
^3- 

Another  observation,  not  quite  as  apparent  as  the  one  cited  above,  involves 
virtual  removal  of  the  integrators  and  differentiators,  thus  reducing  the  system 
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Figure  4.1  Associating  state  vector  with  system  of 
adders,  scalors.  integrators  and  differentiators 


to  a memoryless  interconnection  of  adders  and  scalors.  Letting  the  output  of  the 
/th  integrator  be  Xj  we  see  from  (4.41)  that  tlie  input  to  the  rth  integrator  is 
equal  to  {£)  + a, Ox,-.  Similarly,  letting  Xj  be  the  input  to  the  /th  differentiator  by 
(4.42)  the  output  of  the  /th  differentiator  is  equal  to  {D^hj)Xj.  It  can  be 
reasoned  that  for  an  interconnection  of  adders,  scalors,  integrators  and  dilferen- 
tiators  the  input  and  output  of  the  system  will  remain  unaltered  if  the  /th 
integrator  is  removed  and  replaced  by  an  input  x,-  applied  to  the  terminal  to 
which  the  output  ot  the  /th  integrator  is  connected.  The  term  (/)+a,)x,  plays 
the  role  of  a suppressed  output  at  the  input  terminal  of  the  /th  integrator.  In  like 
fashion  the  system  output  and  input  remains  unaltered  on  removing  the  /th 
differentiator  and  replacing  it  by  an  input  (D+  b/)x/  applied  to  the  terminal  to 
which  the  output  of  the  /th  differentiator  is  connected.  Xj  plays  the  role  of  a 
suppressed  output  at  the  input  terminal  of  the  /th  differentiator.  On  applying 
the  above  reasoning  to  the  interconnection  ol  Fig.  4.1  we  see  the  system  can  be 
reduced  to  a series  of  segregated  memoryless  interconnections  of  adders  and 
scalors,  as  exemplified  in  Fig.  4.2.  From  Fig.  4.2  we  obtain  by  inspection  the 
simultaneous  equations 

kiiD  + hi  >X|  = XJ  - tt 
(/2  + hl)x3  - (/)-ui)X|  = -k\X2 
{D*ai)xi  = xi 

y = •*! 


Figure  4.2  Memoryless  system. 
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Upon  rearrangement  the  above  equations  become 


+ 


I 


I 


TC2  = - “2X2 

X2  = (a\-b2)x\  - k\X2  + 

y = xi 


(4.43) 


which  can  be  written  in  canonical  form  as 

X = Ax  + Bu 
y = Cx  + Du 

where 


-b2  0 

1 

ki 

■ J_" 
ki 

A = 

1 -02 

0 

B = 

0 

ia\-b2)  -b\ 

J_ 

*■'2- 

C = II  0 0| 


Equations  (4.43)  are  the  desired  state  equations  for  the  equivalent  system  of  Fig. 
4.1.  Tlius,  the  relationship  between  the  system  described  by  the  differential 
equation 


L(D)y  = M(D)u 

and  an  equivalent  system  described  by  the  canonical  equations 

X = Ax  + Bu 
y = Cx  + Du 


CANONICAL  FORMULATION 


61 


comes  into  sharper  focus.  Given  any  proper  time-invariant  system  of  finite  order 
cliaracteri/ed  by  the  above  canonical  equations,  one  can  always  structure  an 
equivalent  system  characteri/.ed  by  a single  (or  set  of)  differential  equation(s)  of 
the  form  (4.1 ). 

The  above  technique  is  but  one  of  many  means  to  realize  a system  de- 
scribed by  an  equation  of  form  (4.1).  In  summary  the  technique  is  as  follows: 
Given  a system  characterized  by  the  relation 

l.lD)y  = m^)u 

an  equivalent  system  can  be  constructed  of  adders,  scalors.  differentiators  and 
integrators.  Tlie  equivalent  system  is  a realization  of  the  given  system.  A state 
vector  X can  be  associated  with  the  equivalent  system.  Since  the  two  systems  are 
equivalent  it  follows  that  x is  also  a state  vector  for  the  given  system.  Similarly, 
the  state  equations  for  the  equivalent  system  may  be  regarded  as  the  state 
equations  of  the  given  system. 


4.6  METHOD  OF  PARTIAL  FRACTIONS 

As  a special  case  of  the  realization  technique  we  examine  the  methods  of  expan- 
sion by  partial-fractions.  Our  interest  in  this  method  is  in  the  fact  that  it  yields 
state  equations  in  which  the  A matrix  is  in  diagonal  form,  thus  the  eigenvalues 
(or  characteristic  roots)  associated  with  the  system  are  readily  identified.  This, 
of  course,  requires  factorization  of  t.(D).  which  in  certain  cases  may  prove  to  be 
a disadvantage  of  the  technique.  It  is  desired  that  /,(/>)  be  factored  and  put  into 
the  form 

lAD)  = (D- X|  )“'//>- X2)“^.  (0- \,)“‘'  (4.44) 

where 

HD)  = + ...  + «o 

0|  +012  ■ ■ ■ + 01^  - » 


The  X’s  are  the  distinct  roots  or  zeros  of  t.(D)  and  the  a's  are  their  respective 
multiplicities. 

To  illustrate  the  partial  fraction  techniques  we  will  analyze  two  examples: 
the  case  of  simple  zeros  and  the  case  of  multiple  zeros.  For  the  case  of  simple 
zeros  consider  the  system  of  integrators  shown  in  Fig.  4.3.  We  have  the  condi- 
tions tt]  = «2  “ C],  C2. ...  are  constants. 
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Figure  4.3  System  of  simple  zeros. 


Using  (4.44)  the  form  of  the  input-output  relation  is 


(D- X,)(D-  \2)(D- Xj). ..(£»-  X„)>'  = M{D)u  (4.45) 


Let  the  system  of  simple  zeros  be  proper,  i.e,,  the  degree  of  L{D)  is  greater  than 
the  degree  of  M(D).  and  (D-  X,),  where  / = l,2,...n,  is  not  a factor  oi M{D). 
Clearly,  by  (4.45)  the  transfer  function  for  the  system  having  simple  zeros  is  of 
the  form 


m = 


Ms) 

(s-  Xi)(s-  X2)...(s-  X„) 


(4.46) 


On  expanding  H(s)  in  terms  of  partial  fractions  we  have 


H{s)  = 


s - X] 


+ 


f2 

S - X2 


+ ...  + 


Cn 

s-  X„ 


+ dQ 


(4.47) 


where  the  constants  c,-  are  dependent  on  the  X,-  and  the  coefficients  of  M(D).  The 
term  t/g  is  equal  to  zero  if  the  degree  of  M(D)  does  not  exceed  n - 1 . 

Digressing  temporarily,  we  verify  that  (4.47)  is  the  transfer  function  for 
the  system  in  Fig.  4.3.  The  Laplace  transform  of  the  state  equations 

X = Ax  + Bu 


y = Cx  + Du 

where  A.  B,  C and  D are  constant  matrices,  gives 


slX(s)  - x(0)  = AX(s)  + BU(i) 


(4.48) 
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Y(s)  = CX(s)  + DU(s)  (4.49) 

Solving  the  dynamical  equation  (4.48)  lor  X(s)  results  in 

X(s)  = (si  - A)"‘x(0)  + (si  - A)"*BU(s) 

which,  when  substituted  in  (4.49),  yields  for  the  output 

Y(s)  = C(sl-  A)"*x(0)  + lC(sl-  A)“'b  + D1U(s)  (4.50) 

The  output  in  (4.50)  identifies  the  system  zero-state  response  and  zero-input 
response  in  the  frequency  domain.  We  designate  each  respectively  as  Z(0;U(s)) 
and  Z(X(s);0).  Letting  the  state  x(0)  be  tlie  xero  state  the  system  response  is, 
from  (4.50), 

Z(0;U(s))  = [C(sl- A)-’b  + D]U(s)  (4.51) 

For  zero  input  the  system  response  is 

Z(X(s);0)  = C(sl- A)-'x(O)  (4.52) 

As  was  specified  earlier  the  matrix  D is  zero  for  a proper  differential  system. 
Letting  the  input  to  the  system  be  the  impulse  6(r),  and  since  T{5(r)}  = 1 , the 
transfer  function  is  identified  from  (4.51)  as 

H(s)  = C(sl- A)"'b  (4.53) 

In  comparing  (4.53)  with  (4.52)  it  is  seen  that  for  the  simple  case  chosen -a 
proper  system  of  simple  zeros-the  transfer  function  is  related  to  the  zero  input 
response  as 

H(s)  = Z(X(s);B) 

i.e.,  the  transfer  function  H(s)  is  the  Laplace  transform  of  tlie  zero  input  re- 
sponse starting  in  state  x(0)  = B.  Assuming  the  matrix  (si  - A)  is  nonsingular  the 
matrix  identity  for  the  inverse  operator  can  be  written  as 

, adi(sl-A) 

(si  - A)-’  = — - 

Isl-Al 

where  |sl-A|  and  adj(sl  - A)  are  the  determinant  and  adjoint  of  (si  - A), 
respectively.  From  (4.53)  the  transfer  function  becomes 


»«*•  • -.  — ■T«'  ' VX 
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H(s)  = 


C[adj(sl-A)]B 

lil-AI 


(4.54) 


By  inspection  the  state  equations  of  the  system  of  simple  zeros  shown  in  Fig.  4.3 
are 


JCl  = + u 


Xi  = X2Jf2  + u 


y = CiATi  + C2^2  + (4.55) 

from  which  the  system  matrices  are 


0 ..." 

■ 1 ■ 

A = 

0 

X2  ... 

B = 

1 

_0 

...  x„_ 

_ 1 _ 

C = [ciC2...c„|  (4.56) 


Substituting  the  expressions  for  matrices  A,  B and  C in  (4.54)  yields 


C\  Cy 

H(s)  = — + 


s-  X 


s - X2 


+ ...  + 


s - X„ 


thus,  verifying  (4.47). 

In  accordance  with  the  above  analysis  we  assert  that  for  a proper  system  of 
simple  zeros  characterized  by  an  input-output  relation  of  the  form  (4.45) 

(D-  XiXO-  X2). ..(£>-  X„)  = M(D)u 

where  the  various  X,-  are  distinct  and  no  term  (Z>  - X,)  is  a factor  of  M(D),  the 
transfer  function  f/(s)  can  be  represented  in  partial  fraction  form  (4.47): 


Cl 

ms)  = -A-  + 

s - Xj 

The  system  as  shown  in  Fig.  4.3  is 
equation  (4.45).  The  vector 


s - X2  s- 

a realization  of  the  system  represented  by 
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defined  by  (4.55)  as 


x(0  = {xi,xi,...x„) 


X = X/X/  + M /■  = 1,2 n 


qualifies  as  the  system  state  vector.  Equations  (4.56)  are  the  corresponding  state 
equations  for  the  system. 

In  extending  the  above  discussion  to  the  case  of  multiple  zeros  we  first 
consider,  for  simplicity,  the  case  where  /,(£))  has  one  multiple  zero  Xi  of  order 
a.  The  expression  for  /.(D)  becomes 

/.(D).v  = l(D-Xi)“(D-X„4.i)...(D-X„)|y 

= M(D)u  (4.57) 

The  corresponding  expression  for  H{s)  can  be  written  as 


H{s)  = 


(s-  \\fU  - Xq+i)...(s-  X„) 


Expanding  H{s)  into  partial  fractions  gives 

C]  C2 

//(s)  = — + 

(s-X,)“  (S'Xi)“-‘ 


(s-  X,) 


^a+1  . . , , 

+ + ...  + ■ + do 

(s-X„+i)  (s-X„) 


By  (4.59)  it  is  seen  that,  in  essence,  one  multiple  zero  of  order  a has  the 
equivalent  effect  of  adding  a integrators  to  one  of  the  integrator  circuits.  This  is 
shown  graphically’  in  Fig.  4.4.  (Obviously,  for  the  case  of  more  than  one 

multiple  zero,  say  m multiple  zeros  each  of  order  a.ji.y respectively,  the 

equivalent  effect  would  be  to  add  a integrators  to  the  respective  circuit  labeled 
a,  (3  integrators  to  the  respective  circuit  h,...|  integrators  to  the  respective 
circuit  labeled  m.)  Choosing  the  output  of  each  integrator  as  an  element  of  the 
state  vector  x - {x\.xi,....x„)  the  state  equations  are.  by  inspection. 


1 . from  Zedeh  and  Desocr.  Linear  System  Theory.  McGraw-Hill,  New  York  1963. 
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Figure  4.4  System  of  multiple  zeros. 


Xi  = XiXi  + X2 


Xa-l  = XlJfa-1  + Xa 
Xa  = “ 


■*ri  ^ 

y = ciX\  + C2AC2  + ...  + c„x„  + do  (4.60) 

wherein  it  is  clear  that  this  choice  of  x qualifies  as  a system  state  vector.  Hence, 
for  a proper  system  characterized  by  the  input-output  relation 

{D-  - ~>^)y  = mu 

where  the  various  X,-  are  distinct  and  no  term  {£>  - X,)  is  a factor  of  M{D),  the 
transfer  function  H(s)  can  be  represented  in  partial  fraction  form  (4.59).  The 
system  of  Fig.  4.4  is  a realization  of  (4.57).  The  corresponding  state  equations  in 
canonical  form  are,  from  (4.60), 

X = Ax  + Bu 

V = <Clx>  + dgu 
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where 


^1 

1 

0 ... 

0“ 

'o“ 

0 

^1 

1 ... 

0 

A = 

0 

0 

X,  1. 

0 

B = 

0 

0 

0 

0 ... 

K. 

_1_ 

C = [ci  C2  • C/il 


5 

Solutions  to  the 
Canonical  Equations 


i 


5.1  FIXED  CONTINUOUS-TIME  SYSTEMS:  TIME  DOMAIN  ANALYSIS  | 

The  differential  equations  governing  the  behavior  of  linear  continuous  time  j 

systems  were  established  in  (2.26)  and  (2.27)  as  ( 

1 

k{!)  = A(f)x(f)  -f  B(t)u(f)  I 

y(0  = C(?)x(r)  D(r)u(r)  | 

For  fixed  systems  the  matrices  A.  B.  C and  D are  fixed,  i.e.,  they  are  inde-  | 

pendent  of  time.  In  determining  the  time-dependent  solutions  to  (2.26)  and  ] 

(2.27)  for  fixed  systems  we  first  consider  the  homogeneous  form  of  (2.26)  | 

where  u{t)  = 0.  Equation  (2.26)  becomes  < 


--xiA  = AxIA  (S.n 


i 

i 
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where  the  matrix  A is  a constant  matrix.  Solving  (5.1 ) for  x(/)  between  the  time  j 

limits  /o  and  f we  have 

i 

I 

I 

(5.2) 


it  is  seen  that  the  state  transition  matrix  can  be  represented  as 
4>(r-  /o)  = 

= I + A(t-fo)  + + ...  (5.3) 

Tlius.  by  (5.3),  4>(t)  is  also  referred  to  as  the  fundamental  matrix. 

The  complete  time-dependent  solution  to  (2.26),  where  A and  B are  con- 
stant matrices,  is  readily  obtained  through  the  method  of  variation  of  the 
parameter.  Assume  the  solution  to  (2.26)  is 

x(t)  = (5.4) 

where  f(r)  is  to  be  determined.  The  time  derivative  of  solution  (5.4)  gives 

kit)  = At>'^‘'''o^f(/)  + e^*'"'o^f(r)  (5.5) 

Substituting  (5.4)  into  (2.26)  and  using  identity  (5.5)  gives 

X-  = Bu(r) 

= e'^^'"'o*f(/)  (5.6) 


x(f) 

x(^o) 

x(r)  = e''^'"'‘^^x(fo) 


Upon  comparing  (5.2)  with  (3.47).  wherein 

x(/)  = <!>(/-  /‘o)x(fo) 


r 


1 
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Multiplying  each  side  of  (5.6)  from  the  left  by  e and  integrating  be- 
tween the  limits  assuming  = 0,  we  have 


jT  = j am 


,-A(£-to) 


Bu(?)r/? 


which,  when  substituted  in  solution  (5,4),  gives 

j 


x(r)  = r e''^(f-'o)Bu(J)r/? 

oo 

_ pA(r-fo)  r 'e-A(?-ro)Bu(5)d?  + J*  (5.7) 
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Finally,  on  substituting  (5.1 1)  into  (2.27)  the  input-output-state  relation  lor  a 
eontinuous-tiine  system  is 

V(/)  = C 

Equations  (5,11)  and  (5.12)  are  the  system  state  equations.  It  is  clear  from  5 

(5.1 1)  that  the  system  state  at  time  t can  be  determined  if  the  system  state  at  i 

some  previous  time  /q  (to  < t)  is  known  and  if  the  input  u(r)  is  known.  The  i 

manner  in  which  the  initial  state  is  “transformed”  is  characterized  by  the  make- 
up of  <1>  and  how  the  input  is  applied,  i.e.,  by  the  matric  operators  A and  B.  The 

system  output  (5.12)  reflects  a dependency  on  all  four  operators.  i 


<l>(/)x(0)  + r <I>(r-?)Bu(|)r/? 
•^0 


+ Du 


(5.12) 


5.2  FIXED  CONTINUOUS-TIME  SYSTEMS:  FREQUENCY  DOMAIN 
ANALYSIS 

For  fixed  continuous-time  systems  the  differential  equations  to  be  solved  are 

X = Ax  -t-  Bu 
y = Cx  + Du 

where  A,  B,  C and  D are  constant  matrices.  From  previous  discussions  the 
method  of  Laplace  transforms  was  seen  to  be  a convenient  method  for  solving 
equations  of  this  form.  It  was  established  in  (4.48)  and  (4.49)  that  the  Laplace 
transform  of  both  sides  of  the  above  equations  gives,  respectively, 


sX(s)  - x(0)  = AX(s)  + BU(s) 

Y(s)  = CX(s)  -t  DU(s) 

From  (4,48) 

X(s)  = (si  - A)"'x(0)  + (si  - A)-'bU(s)  (5.13) 


dri(s) 

jC{x|(r)} 

X2(s) 

= 

£{x2(f)} 

X„(s)_ 

£{XnU)} 
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which,  when  substituted  in  (4.49),  resulted  in  the  input-output  equation 


Y(s)  = C(s|-  A)-'x(O)  + [C(sl-  A)'‘b  + DlU(i) 


I 


> < 


K|(s) 

.C{vi(F)} 

Y2(s) 

= 

X{.V2(/)} 

(5,15) 


Tlie  zero-state  response,  i,e,,  x(0)  = 0,  identifies  the  transfer  function 
matrix  H(s)  in  (4.50)  as 


H(.?)  = C(sl  - A)  'b  -f  D 

~//ll(s)  Unis)  ...  H\,M) 

L^AiiCs)  HmM_\ 


(5.16) 


(5.17) 


Writing  the  zero-state  response  as 

Y(s)  = H(s)U(s)  (5.18) 

the  ith  component  of  the  transform  vector  Y(s)  is 
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where,  lor  convenience,  to  =0.  Applying  the  theorem  for  the  Laplace  transform 
of  a convolution  the  Laplace  transform  of  the  above  equation  is 

X(.v)  = $(,v)x(0)  + $(s)BX(s)  (5.21) 


$(s)  = X{<J>(r)}  (5.22) 

<h(/)  has  previously  been  identified  as  the  fundamental  matrix  of  the  system. 
Comparing  (5.21)  with  (5.13)  the  l-aplacc  transform  of  the  fundamental  matrix 
is 

$(s)  = (si- A)“'  (5.23) 


c*'  = <l>(f)  = r‘{(sl-  A)"'}  (5.24) 

Thus,  the  fundamental  matrix  <K0  is  the  inverse  Laplace  transform  of  the 
matrix  (si  A)  Accordingly,  <t>(s)  is  referred  to  as  the  characteristic  fre- 
quency matrix  of  the  system. 

Using  the  matrix  identity  for  an  inverse  operator  equation  (5.23)  can  be 


$(s)  = (si  - A) 


_l  _ adj(sl  A) 


where  |sl  - A|  and  adj(sl  A)  are  the  determinant  and  adjoint  of  (si  - A),  re- 
spectively. Lor  a system  of  order  n.  A is  an  nXn  matrix.  Therefore  the  deter- 
minant |sl  - A1  is  a polynomial  in  s of  degree  n.  which  can  be  written  as 

|sl  Al  = s + uis"“'  -I-  U2s"~^  -I-  ...  -I-  a„_|S  a„ 

= (s-X,)“i(s-\2)“2...(s-X„)“« 


Polynomial  (5.26)  is  the  characteristic  polynomial  of  the  system.  Its  zeros 

X|.  X2 X„  (those  values  of  s for  which  (si  - A|  = 0)  arc  the  characteristic  roots 

or  eigenvalues  of  the  system.  Accordingly,  the  equation 


Isl  Al  = 0 


is  called  the  characteristic  equation  of  the  system. 
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For  the  simple  case  where  ri  is  2 or  3 expansion  of  adjfsl  - A)  in  (5.25)  by 
cofacTors  suffices  to  determine  $(s).  Let  (si  A)/y  denote  the  (n~  l)X  (n-  I) 
submatrix  of  (sl  - A)  formed  by  deleting  row  / and  column  /.  The  scalar 

‘•//  = (-l/^^KsI- A),y|  (5.28) 

is  the  cofactor  of  the  (/,/)  element  of  (sl  - A).  The  nXn  matrix  (sl  - A)' 
formed  by  all  the  r ,y  elements  is  the  cofactor  matrix  of  (sl  - A): 


f|| 

t’12  ■ 

cofactor  (sl  A)  = (sl  - A)'  = 

C2I 

‘■22  ■ 

■ ‘2n 

Oil 

‘«2  ■ 

^'tm 

The  adjoint  of  (sl  - A)  is  the  transpose  of  the  cofactor  matrix  of  (sl  - A): 


adj(sl  - A)  = (sl  - A)'  = 


C||  ‘’21  Oil 

‘‘12  C22  c„2 


^2n  ^rm 


(5.30) 


The  tilde  denotes  the  transpose. 

In  principle  the  above  procedure  provides  for  determining  $(s).  However, 
for  values  of  n greater  than  3 this  procedure  becomes  impractical  since,  essen- 
tially. n\n-  l)X  (n-  I)!  multiplications  are  involved.  A more  practical  tech- 
nique. which  also  lends  itself  to  machine  computation,  involves  extending  the 
method  of  expanding  a rational  function  to  matrices  whose  elements  are  rational 
functions.  We  let 


(sl  - A)"' 


B(s)  ^ ^ 
|sl  - A|  p(s) 


where 


B(s)  = BflS"-'  + B]s"-2  -I-  ...  -f  B„_2S  + B„.| 


(5.31) 


(5.32) 


and 


-I-  fl2*"  ^ 


p{s)  = s + a|S"  * 


+ ...  + a. 
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The  coefficients  B(),Bi B„_i  are  constant  nXn  matrices,  and  was  previ- 

ously defined  in  (5.26).  Given  an  nXn  matrix  A the  coefficients  a,-  of  the 
polynomial  p(s)  and  the  coefficients  B,-  of  B(s)  are  determined  by  the  following 
algorithms:  * 


a,  = tr(B,_iA) 

1 = 1,2, ...,n 

(5.33) 

B,  = B,_iA  + a,l 

1 = 1,2, ...,n 

(5.34) 

The  above  algorithm  leads  to  the  following  relations  between  the  coefficients  Oj 
and  B, : 

ai  = -tr(A) 

II 

O 

GO 

02  = tr(B,A) 

Bj  - B()A  + a\\ 

aa  = -ytr(B2A) 

Ba  = B]A  + 02^ 

“n-l  " " „ - 1 ‘f(B„-2A) 

Brt-1  “ ^n-\^ 

a«  = -^tr(B„_iA) 

0 = B„_iA  + a„l 

(5.35) 

Equations  (5.35)  are  a more  efficient  procedure  in  computing  (si  - A)~’ . Only  n 
matrix  multiplications  are  required  as  opposed  to  n\n-  l)(n-  1)1  using  the 
method  of  expansion  by  cofactors. 

To  complete  the  computation  of  <I>(r)  in  (5.24)  we  compute  the  inverse 
Laplace  transform  of  (sl-A)“*.  Equations  (5.35)  result  in  a matrix  for 
(si  - A)”’  whose  elements  are  rational  functions.  The  inverse  Laplace  transform 
of  a proper  rational  function  (considered  to  be  the  ratio  of  two  polynomials 
where  the  degree  of  the  numerator  is  greater  than  that  of  the  denominator)  is 
most  readily  obtained  by  expanding  the  rational  function  into  partial  fractions. 
As  an  example  let  F{s)  be  a proper  rational  function  in  s defined  as 

FU)  = N{s)IQ{s)  (5.36) 


1.  V.  N.  l addeeva,  “Computational  Methods  of  Linear  Algebra,”  Dover  Publications,  Inc., 
New  York,  1959. 
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If  the  degree  of  A'(s)  is  greater  than  that  of  Q(s).  and  if  /'(s)  has  poles  at 

i = X|,X2 X„  and  the  order  of  the  pole  at  i = X^  is  aj  then  (5.36)  can  be 

factored  as 

" I 

/•'(s)  = E Z!  nyT-Vv  <5.37) 

/ = ! 1 = 1 (S-Xy)' 

A hint  of  the  structure  of  (5.37)  was  first  seen  in  (4.58)  and  (4.59).  Hach 
nonrepeated  factors  - X(i  in  the  denominator  of  (5.37)  results  in  a term 


‘“(s-Xs) 

whereas  each  of  the  repeated  factors  (s  - \jfi  give  rise  to  otj  terms 

Va./ '■ + y„.  ,/ '■ + ...  + yi/— ^ — 

(S  - X/)“/  ' (S  - X/)“/->  " ($  - Xy) 

For  a simple  first-order  pole  at  s = Xj>  the  coefficients  y,y  may  be  evaluated  by 
the  formula 


where  £)(Xe)  is  the  derivative  of  Q(s)  evaluated  at  s = Xj.  For  a pole  of  order  ay  at 
s = Xy  we  have 


“ (a. 


I tT/-'  r A'(s)  1 

</  - /•)!  r/s“/-'  [q(s)Ks  - Xy)“/J  s=Ay 


where  /=  1,2 ay. 

To  obtain  the  inverse  transform  of  F(s)  consider  the  following:  Die 
Laplace  transform  of  a time  function  of  the  form  f(l)  = - 1 )!  is 


(«-l)!J  s 


I 


V 

r 


/ 

I 

I 

! 
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Secondly,  the  transform  of  the  product  of  two  functions,  wherein  one  of  the 
functions  is  an  exponential,  is 


£{Ar)e-^}  = /•'(*+ A) 


Combining  the  ideas  of  (5.41 ) and  (5.42)  we  can  write 


jC 


(«-!)! 


1 

(s  + Xf 


(5.42) 


(5.43) 


Applying  the  rationale  of  (5.43)  to  (5.37)  the  inverse  Laplace  transform  of 
(5.37)  can  be  written  as 

m = jC-'{F(s)} 


m = Yij 


1 


i=u=\  ' {s-Xjy 


" “7  w-l 

= _L_ 

/ = ! 1 = 1 (1-1)! 


(5.44) 


Expression  (5.44)  prescribes  a method  for  determining  the  inverse  Laplace  trans- 
form of  each  element  of  $(s)  = (si  - A)-^  thereby  determining  <P(t).  The  matrix 
form  of  <P(f)  can  be  established  by  expanding  (5.37).  We  obtain 


1 

(s-X,y 


(5.45) 


where  j = 1,2 n <n.  The  inverse  Laplace  transform  of  (5.45)  gives 

n'cn  “/ 

4>(r)  = e^'  = E e^i'E'fij  ’ , (5  46) 

i=\  i=l  Mt-1)! 

where  (5.44)  served  as  a guide  to  establ'ish  the  general  form  of  <!>(/). 

To  illustrate  the  use  of  equations  (5.37)  through  (5.44)  consider  the  in- 
verse Laplace  transform  of 


^ -2s‘*-  11s-  16 
s^(s  + 1 )^(s  + 2) 


(5.47) 


L ^ 
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For  the  term  the  multiplicity  is  2.  The  respective  coefficients  K,y  deter- 
mined by  (5.40)  are  Tii  ~ 2 and  T21  ” “2.  Similarly,  the  coefficients  Ty  associ- 
ated with  the  degenerate  eigenvalue  of  (s  + 1)2  are  T]2  = 0 and  T22  “ 3.  From 
(5.38)  or  (5.39)  the  term  (s  + 2)  has  the  coefficient  Tja  ='4.  Thus  (5.47)  can 
be  written  in  form  (5.37)  as 


F{s)  = 7 - ^ + 


3 

(S  -t-  1)2 


By  (5.44)  the  inverse  Laplace  transform  of  (5.48)  is 


/(/)  = 2 - 3/  + 3/e-'  - 4e-2'  (5.49) 


Clearly,  the  inversion  of  the  n X n matrix  $(s),  using  the  above  method,  requires 
)i2  elemental  inversions. 


5.3  FIXED  DISCRETE-TIME  SYSTEMS:  TIME  DOMAIN  ANALYSIS 

A linear  discrete-time  system  of  fixed  form  ■'  “overned  by  a system  of  difference 
equations  of  normal  form',  from  (2.26)  and  (2.27), 

x(fc+  1)  = Ax(fc)  + Bu(fe)  (5.50) 

y(A:)  = Cx(/t)  + Du(fe)  (5.51) 

where  the  index  it  = 1,2, ...,«>  prescribes  the  discrete-time  sequence  and  the 
matrices  A,  B,  C and  D are  constant.  The  solution  to  (5.50)  may  be  found 
directly  by  recursion.  Substituting  in  (5.50)  discrete  values  of  k we  have 

x(l)  = Ax(0)  -F  Bu(0) 
x(2)  = Ax(l)  + Bu(l) 

= a2x(0)  + ABu(O)  -F  Bu(l) 
x(3)  = Ax(2)  + Bu(2) 

= a2x(0)  + a2bu(0)  + ABu(l)  + Bu(2) 


it-l 

x(it)  = A*x(0)  + E A'Bu(<:  - S - 1)  (5.52) 

ii=0 

1.  See,  for  example.  Freeman,  It.,  Discrete-Time  Systems,  John  Wiley  and  Sons.  New  York 
(1965). 
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or 

*-l 

x(k)  = A*x(0)  + E A*-“-'Bu(K)  (5.53) 

c=o 

The  fundamental  matrix  for  a discrete-time  system  becomes,  from  (5.52)  or 
(5.53), 

<I>(A:)  = A*  (5.54) 

where 

A*:  = A1A2  ...  A* 

Ai  = At  = ...A*:  = A (5.55) 

In  terms  of  the  fundamental  matrix  <t>(*)  the  state  equation  (5.52)  or  (5.53) 
becomes,  respectively, 

*-l 

x(/t)  = 4>(A:)x(0)  + E 4>(e)Bu(/t-  8-1)  (5.56) 

c=o 


or 


k-\ 

x(k)  = <I>(A:)x(0)  + E <!>(*- 8- l)Bu(A)  (5.57) 

c=0 


Substituting  (5.56)  into  (5.51)  the  output  equation  becomes 

*-1 

y(/t)  = C<l>(/fc)x(0)  + E C<l>(8)Bu(/t-8- 1)  -r  Du(A)  (5.58) 
c=o 

''An  alternate  expression  for  the  output  is  readily  derived  using  the  state  vector 
form  (5.57).  Solutions  (5.56)  and  (5.58)  are  the  discrete-time  analogs  of  (5.1 1) 
and  (5.12),  respectively.  The  corresponding  formulation  of  the  discrete-time 
state  equations  is  given  in  Appendix  C. 


5.4  FIXED  CONTINUOUS-TIME  SYSTEMS:  DISCRETE  INPUTS 

In  many  continuous-time  systems  the  input  is  often  a sampled  signal,  thus  char- 
acterizing the  input  as  discrete.  There  are  two  kinds  of  sampled  signals  (and 
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associated  discreteness)  we  will  consider:  (I)  the  piece-wise  constant  signals 
where  the  sampling  function  is  a pulse  of  constant  amplitude  over  a time  interval 

u(r)  = u(ryt)  tk  < t < r*+|  (5.59) 

and  (2)  the  impulse-modulated  signal  represented  by  a modulated  delta-f  unction 
series 

uit.tk)  = (5.60) 

/f=ii 

Signals  having  the  forms  of  (5.59)  and  (5,60)  are  respectively  depicted  in  Figs. 
5.1a  and  5.1b.  In  generating  u(r*)  a zero  order  holding  circuit  of  sorts  is  implied. 
Tlie  various  tk  are  arbitrary  but  satisfy  the  condition  tk+\  > tk- 

From  previous  discussions  the  response  of  a continuous-time  system  to 
sampled  inputs  can  be  readily  determined.  In  fact,  it  can  be  determined  'vith 
considerably  more  ease  than  would  be  required  to  determine  the  response  for  all 
time.  The  continuous-time  system  is  governed  by  the  dynamic  equation 

X = Ax  + Bu 

wherein  the  solution  is 

x(r)  = 'F(r)x(ro)  + j ?)Bu(^)r/? 

'^'0 


'I  '2 6 '»•! 

Figure  5.1  Sampled  signal. 
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Let  the  input  be  the  sampled  signal  of  (5.59).  It  is  a signal  of  constant  amplitude 
over  a prescribed  time  interval.  Then 


x(f)  = <t>(r)x(/o)  + M>(f- (5.61) 

Now  let  the  input  consist  of  a series  of  constant  vectors  of  form  (5.59).  Fhen  at 
time  + T.  where  0 <t  < /*+i-  for  each  time  interval  (ftT^+l)-  we  have 

xUk  +T)  = <l>(r)x(r*)  + I +’■-  ?)Bu(r*)c/f  (5.62) 

If  the  “staircase”  input  changes  uniformly  at  intervals  tk  = kT  equation  (5.62) 
becomes 


x(kT+T) 


,kT+T 

<D(T)x(A:r)  + <l>(fcr+T- ?)Bu(/cm 

■^kT 


(5.63) 


Finally,  for  t = T 


~kT+T 

x{kT T)  = ^{T)x(kT)  * I ^(kT  + T - (5.64) 
'^kT 

Clearly  (5.64)  can  be  written  as  the  difference  equation 


x(k  + 1)  = Ax(i<:)  + Bu(A:) 


where 


A = <I>(r)  = e'^' 

T 

B = f <P(T-  ?)d? 


(5.65) 

(5.66) 

(5.67) 


Thus,  the  continuous-time  system  governed  by 
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For  the  staircase  input  u(t)  = 2*  equation  (5.64)  specifies  the  state  vector  as  the 
difference  equation 


x(kT+  7’)  = y 


■ 4e-T- 
-4(e-T - e-*’') 


e-T-e^Tl 
-e~T  + 4e~*^. 


which  is  of  the  form 

x(A:  + 1 ) = Ax(Ar)  + Bu(A:) 

Hence,  by  (5.56)  the  state  vector  for  the  continuous-time  system  having  a stair- 
case input  is 

k-\ 

x(fe)  = A*x(0)  + H A«B  2*-®-' 
e=0 


where  A and  B are  determined  by  equations  (5.66)  and  (5.67),  respectively. 

We  next  consider  the  continuous-time  system  wherein  the  input  is  the 
series  of  impulses  represented  by  (5.60).  Referring  once  again  to  state  equation 
of  a continuous-time  system  we  have 

x(f)  = 4>(/)x(fo)  + f 

At  time  tjt  + where  0 <T<f*+i  - t/t,  we  have  on  substituting  (5.60)  into  the 
above  state  equation 

x(r*+T)  = <i>(T)x(rt)  + <l>(T)Bu(ft)  (5.68) 

Equation  (5.68)  is  the  state  equation  of  a continuous-time  system  where  the 
input  is  a series  of  impulses  occurring  at  times  (^  = 0, 1,2,...°°).  If  the  im- 
pulses occur  at  uniformly  spaced  time  intervals  where  = T.  equation 

(5.68)  gives  the  state  equation  for  this  condition  as 

xOtr+T)  = 4>(T)x(kr)  + <I>(t)Bu(A:7’)  (5.69) 


•ik. 
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where  0<t<T.  Lastly,  for  t = T we  have  as  the  discrete-time  state  equation  for 
a continuous-time  system 

xlArr-FT)  = MT)x{kT)  + <t>(T)Bu(kT)  (5.70) 


5.5  FIXED  DISCRETE-TIME  SYSTEMS;  r DOMAIN  ANALYSIS 

Sampled  data  systems  can  be  considered  as  continuous-time  systems  operating 
on  discrete-time  functions.  The  transform  addressing  discrete-time  systems  is 
introduced  for  the  same  reason  that  makes  the  Laplace  transform  useful  in  the 
study  of  continuous-time  systems.  Procedures  similar  to  analyzing  continuous- 
time systems  in  the  frequency  domain  will  be  followed  to  analyz.e  discrete-time 
systems  in  the  z domain  (where  z is  a complex  variable). 

The  transform  of  a discrete-time  function  /(k)  is  a power  series  in  z"* . 
The  coefficients /(/:)  are  the  amplitudes  of  the  discrete-time  signal.  We  have 

S{/(^)}  = /(k)z-^  (5.71) 

where  ^-{/(A:)}  is  the  transform  of/(k).  For  discrete-time  systems  those  values 
of  y(k)  where  k<0  are  of  less  interest  than  where  ^>0.  The  discussion  to 
follow  will,  therefore,  center  around  situations  of  positive  values  of  k.  Accord- 
ingly, our  consideration  of  (5.71)  will  be  bound  by  tlie  values  A;  = 0, 1,2 

From  Appendix  D the  transform  of  tlie  first  forward  difference  equa- 
tion is 

S(Mk)}  = Z{f(k  + ))-  /(k)} 

= (z- l)2;{/(k)}-  z/(0)  (5.72) 

where 


S{/(k)}  = Hf(k)z->‘  = /(O)  -F  /(l)z-'  + /(2)z-2  -f  ... 
k=0 

Z{f(k  + l)}  = /(I)  + f(2)z-'  -F  /(3)z'2  + ...  = z(2.{/(A)}-f/(0)1  (5.73) 

The  characterization  of  a linear  discrete-time  system  of  fixed  form  was  speci- 
fied earlier  by  equations  (5.50)  and  (5.5 1 ): 
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x(A-  + 1)  = Ax(A)  + Bu(A) 
y(k)  = Cx(A)  + Du(A) 

Taking  the  S transform  of  both  sides  of  the  above  normal  equations  we  have, 
using  property  (5.73), 

zX(z)  - zx(0)  = AX(z)  + BU(z)  (5.74) 

Y(z)  = CX(z)  + DU(z)  (5.75) 

where 

U(z)  = S{u(A)} 

X(z)  = l'fx(A)} 

Y(z)  = S{v(A)} 

Solving  (5.74)  for  X(z)  we  have 

(zl  - A)X(z)  = zx(0)  + BU(z) 

X(z)  = (zl  - A)-'zx(0)  + (zl  - A)-'bU(z)  (5.76) 

which,  when  substituted  in  (5.75),  gives  the  discrete-time  output  as 

Y(z)  = C(zl- A)"‘zx(0)  + [C(zI-A)-‘b  + D|X(z)  (5.77) 

Referring  to  state  equations  (5.56)  and  (5.58)  the  J transform  of  the  output 
equation  (5.58)  gives 

Y(z)  = C$(z)x(0)  -t  [C$(z)z-'b  + D]X(z)  (5.78) 

where 

$(z)  = S{<l>(A)}  = ?{A*}  (5.76) 

Comparing  (5.78)  with  (5.77)  we  see  that 

<l>(z)  = (zl-A)“'z  = (l-Az“’)“’ 


(5.80) 
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Result  (5.80)  can  also  be  derived  directly  from  the  definition  of  the  transform 
for  A*  . 


J{A*}  = 

*=0 

= (zl-A)"*z  (5.81) 

Tlie  zero-state  response  specified  by  (5.78)  is 

Y(z)  = [C$(z)z-'b  + D)X(z)  (5.82) 

Hence,  the  transfer  function  matrix  for  a discrete  time  system  is 

H = C$(z)z-'B  + D (5.83) 

Proceeding  in  the  same  manner  as  in  Section  5.2  the  inverse  transform 
operator  $(z)  can  be  written  as 


4>(z)  = z(zl  - A)  * 


z adj(zl  - A) 
Izl- A| 


zB(z)  _ zB(z) 
Izl  - A|  p(z) 


(5.84) 


where  B(z)  and  p{z)  are  polynomials  in  z.  Replacing  s with  z in  (5.26)  and  (5.32) 
the  ratio  $(z)  can  be  evaluated  as  prescribed  by  (5.31)  througli  (5.35).  Repre- 
senting B(z)  andp(z),  respectively,  as 


B(z)  = Bqz*  '■  + Biz*-2  + ...  + B*_2Z  + 


and 


p{z)  = z*  + ajz*  ' + (72Z*^  ^ + ...  -F  a/(-iZ  + 
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Clearing  the  fraction  we  have 

(z*  +aiz*“*  +a2Z*~^  + ...  +ajz  + a*)l 

= (BqZ*-*  + Biz*’^  + ...  + B/t_2Z  + B*_iKzl  - A) 

= BqZ*  - BqAz*”*  + BiZ*"’  + ...  + Bk-iz  - B*_]A 
Equating  the  coefficients  of  like  powers  of  z gives 

Bo  = I ai  = "frA 

Bi  = BqA  + ail  02  = --jtr(BiA) 

B2  = BiA  + a2l  as  = - jtr(B2A) 


Bt-i  = B*_2A  + ajt_il  Ok-i  = -^tr(B*-2A) 

0 = B*-iA  + a*l  Ok  = -|tr(B*,lA)  (5.86) 

which  results  in  the  algorithms 

a,  = - jtr(B,_,A)  (i  = 1,2 k)  (5.87) 

B,  = Bi_iA  + a,l  0=  1, 2, (5.88) 

The  most  commonly  used  methods  to  determine  <*>(<:),  the  inverse  trans- 
form of  (5.84),  include  inversion  by  partial  fractions,  inversion  using  the  inver- 
sion integral,  or  inversion  by  long  division.  Each  of  these  techniques  requires 
addressing  the  individual  elements  of  the  matrix  $(z).  Consequently,  the  tech- 
niques are  laborious.  To  determine  the  signal  element  /(fc)  corresponding  to  each 
element  f{z)  in  $(z)  it  becomes  necessary  to  expand  f[z)  into  a power  series  in 
z"'.  The  expansion  coefficients  f(k)  are  the  amplitudes  of  the  discrete-time 
signal. 

Inversion  by  partial  fractions  involves  factoring  and  looking-up  in  an 
appropriate  table  the  discrete-time  function  corresponding  to  tlie  transform  to 
be  inverted.  Appendix  D contains  a typical  conversion  fable,  which  of  course. 
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can  be  expanded  considerably.  The  task  of  factoring  the  ratio  of  two  poly- 
nomials was  discussed  earlier.  The  associated  z-lormulation  takes  the  form  of 
equations  (5.36)  througli  (5.40). 

A more  elegant  technique,  which  also  involves  factoring,  is  use  of  the 
inversion  integral 

m)  = (£/(z)/-*c/z  (5.89) 

From  the  calculus  of  residues  the  evaluation  of  (5.89)  results  in 

f(k)  = Z!  residues  of  /(z)z*“'  at  its  poles 

within  circle  of  convergence  of  /(z).  (5.90) 

For  a pole  at  z = \/  of  multiplicity  aj  tire  residue  of/(z)z*”'  is  given  by 


Residue  f(z)z^~^  at 
pole  z = X;,  order  ay 


A third  method  of  inversion  of  the  % transform  applies  when  /(z)  is  a 
rational  function.  Both  the  numerator  and  denominator  can  be  expressed  as 
polynomials  in  z“*.  Using  long  division  the  numerator  is  divided  by  the  denomi- 
nator giving  a series  expression  in  powers  of  z“*.  The  numerical  values  of  th^ 
coefficients  are  the  f{k).  This  method  is  particularly  useful  when  tables  of  S 
transforms  or  the  inversion  integral  cannot  be  used.  However,  a general  expres- 
sion for  f{k),  as  can  be  obtained  by  the  other  two  methods,  is  not  readily 
achieved  through  the  long  division  method. 

To  illustrate  the  above  three  inversion  methods  consider  the  following 
example.  We  want  to  determine  the  inverse  transform  of  the  simple  complex 
function 


m = 


2z^  - 4z 
- 4z  + 3 


by  each  of  the  three  methods. 

(1)  Partial  Fractions.  Factoring /(z)  we  can  write 


Az)  = 


2z^  - 4z 
z^  - 4z  -H  3 
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4 


2z(z  - 2) 

(z-  IXz-  3) 


From  Appendix  D: 


f(k)  =1  + 3* 


(2)  Inversion  Formula.  The  function  has  simple  poles  at  z = 1 and 

z = 3.  There  is  no  pole  at  the  origin.  Thus  from  (5.90) 


2z^  - 4z  i._, 
f(k)  = - ^ z*  1 

z ~ i 

= 1+3* 


Z = 1 


2z^  - 4z 
z-  I 


,*-l 


z=3 


(3)  Long  Division.  Expressing  both  the  numerator  and  denominator  of  /(z)  as 
polynomials  in  z~  * we  have 


/(z)  = 


2-  4z-» 

1 - 4z->  + 3z-2 


The  long  division  gives 


2 + 4z->  + lOz-2  + 28z-3  + 82z-^  + ... 
1 - 4z->  + 3z-2  )2  - 4z-1 

2 - 8z-‘  + 6z-2 


/(O)  = 2 = 1 + 3° 
/(I)  = 4 = 1+3 

/(2)  = 10  = 1 + 32 

f(k)  =1+3* 


f 


* I 
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5.6  FINITE-STATE  MARKOV  SYSTEMS:  INPUT-FREE 

The  analysis  considered  thus  far  has  been  deterministic^  in  nature,  i.e.,  it  was 
assumed  that  the  system  states  at  time  t could  be  determined  with  certainty. 
This  certainty  was  a manifestation  of  exact  knowledge  of  the  characterization 
matrices,  the  state  at  some  initial  time  to,  and  the  input  applied  over  the  time 
interval  (to,t).  For  those  systems,  such  as  large  scale  systems,  where  the  charac- 
terization matrices  are  not  known  exactly,  it  can  be  expected  that  the  system 
state  and  output  can  only  be  described  statistically— even  though  the  input  and 
initial  state  are  known  exactly.  These  systems  are  called  stochastic  systems.  They 
are  characterized  in  terms  of  probabilities  of  being  in  a specific  state  at  some 
given  time,  and  in  terms  of  state  transition  probabilities  associated  with  each 
discrete  state.  If  the  discrete  states  that  the  system  can  assume  are  finite  in 
number  then  such  systems  are  finite-state  stochastic  systems. 

In  formulating  the  probabilistic  description  of  finite-state  stochastic 
systems  we  define  the  following  quantities:  Let  the  probability  that  the  system 
is  in  discrete  state  / at  time  (k  + 1)  be  Pj(k  + 1).  At  an  earlier  time  k the  system 
can  be  in  any  discrete  state  i with  probability  Pj{k).  For  each  state  i there 
corresponds  a state  transition  probability  that  the  system  will  transition  from 
state  i to  state  /.  We  stipulate  that  ipji  is  independent  of  the  past  history  of  the 
system,  i.e.,  it  is  independent  of  how  the  system  arrived  at  state  J from  state  i. 
(Systems  that  possess  this  property  are  known  as  Markov  systems. Thus,  the 
probability  that  the  system  will  be  in  state  / at  time  (k  + 1 ) is 

n 

Pj(k  + l)  = ll^^jiik)pjik)  j = \,2,...,n  (5.92) 

where  n is  the  total  number  of  discrete  states  the  system  can  assume.  (The  index 
n should  not  be  confused  with  our  previous  n used  in  establishing  the  order  of 
the  differential  equation  governing  the  continuous-time  system.)  Equation 
(5.92)  can  be  written  in  vector  form  as 

\ " p{k  + \)  = <Hk)p(k)  (5.93) 


1 . For  the  sake  of  completeness  our  previous  formulation  assumed  that  all  defined  quanti- 
ties were  readily  calculable.  It  must  be  recognized  that  this  is  true  only  for  simple  cases. 
' For  complex  systems  these  quantities  are  not  readily  ascertained  and  must  be  treated 

; statistically. 

I ■ 2.  For  a more  complete  discussion  of  Markov  chains  see,  for  example.  Feller.  W.,  An 

I Introduction  to  Probability  Theory  and  Its  Applications.  Vol.  1,  3rd  Edition.  John  Wiley 

& Sons,  New  York  (1968). 

\ 
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where  p is  an  n-dimensional  state-probability  vector  and  is  an  nXn  state 
transition  probability  matrix.  Clearly,  from  (5.92)  and  (5.93)  4>  is  of  the  form 

V’ii(^)  '^12W  •••  'fiinW 

^k)  = (5.94) 

where  must  satisfy  the  following  probability  conditions: 

iPfj  > 0 for  all  I,/  (5.95) 

^ ifiii  = 1 for  all/  (5.96) 

1 = 1 ' 

If  the  elements  vs,-,-  are  fixed  then  (5.93)  becomes 

p(A:  + l)  = 4>p(*)  (5.97) 

The  derivation  of  (5.97)  was  based  solely  on  probabilistic  reasoning;  inde- 
pendent of  the  linear,  time-invariant  conditions  of  Chapter  3.  However,  formula 
(5.97)  which  describes  a zero-input  probabilistic  system,  is  of  the  same  form  as 
state  equation  (5.50),  which  describes  a zero-input,  linear,  time-invariant  system: 

x()t  + l)  = 4>x(*:)  (5.50) 

Moreover,  the  system  described  by  (5.97)  need  only  be  a finite-state  Markov 
system.  The  likeness  of  (5.50)  and  (5.96)  suggests  that  a class  of  nonlinear 
systems,  where  the  states  are  represented  in  terms  of  probabilities,  is  governed 
by  linear  equations.  Therefore,  the  solution  of  (5.97)  could  be  obtained  by 
following  the  methods  of  Sections  3 and  5.  By  direct  recursion  (5.97)  becomes 

p(k)  = «l>*p(0)  (5.98) 

Hence,  for  input-free  Markov  systems  the  state  probabilities  are  completely 
“determined”  by  knowing  the  transition  probability  matrix  and  the  initial  state 
probability  vector.  The  evaluation  of  (5.98)  is  best  carried-out  in  the  z domain, 
wherein  the  S transform  gives 

zP(z)  - zp(0)  = <I>P(z) 

P(z)  = (I- 4>z-l)-lp(0)  (5.99) 
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We  demonstrate  the  use  of  (5.99)  through  simple  examples  below. 

In  general  Markov  systems  can  be  represented  graphically  by  state  dia- 
grams (signal  flow  graphs)  as  shown  in  Fig.  5.2.  The  nodes  are  the  states  i and 
the  directed  branches  are  the  state  transition  probabilities  ip/,--  The  states  can  be 
grouped  into  mutually  exclusive  state  sets,  whereby  the  system  can  change  from 
every  state  to  every  other  state  within  each  state  set.  Those  state  sets  that  once 
entered  can  never  be  left  are  called  ergodic  sets.  Their  associated  states  are 
recurrent  states.  Tliose  state  sets  that  once  left  can  never  again  be  entered  are 
called  transient  sets.  Generally,  Markov  systems  can  have  many  ergodic  and/or 
transient  state  sets  as  shown  in  Fig.  5.3.  States  1, 2,  and  5,  6 comprise  ergodic 
sets  whereas  states  3,4  are  a transient  set.  Each  set  is  mutually  exclusive  of  the 
other.  As  a minimum  a Markov  system  must  have  at  least  one  ergodic  set.  Such  a 
system  is  an  ergodic  system.  For  a system  having  more  than  one  ergodic  set  but 
no  transient  sets  the  state  sets  are  said  to  be  disjoint.  The  system  can  be  sepa- 
rated into  a number  of  independent  subsystems,  one  for  each  ergodic  set. 
Further,  if  an  ergodic  set  has  at  least  one  state  such  that  the  system  must  return 
to  this  state  periodically  the  ergodic  set  is  periodic,  otherwise  it  is  regular.  We 
shall  expand  the  above  statements  through  the  examples  below. 

Consider  the  simple  system  of  Fig.  5.4,  which  has  an  ergodic  set  but  no 
transient  set.  We  desire  to  evaluate  the  probability  that  this  system  will  transi- 
tion to  states  2 or  3,  given  that  the  system  is  in  state  1 at  time  ^ = 0.  The 
transition  matrix  for  the  system  of  Fig.  5.4  is 


Figure  S.4  Simple  ergodic  system . 
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where  the  transition  probabilities  satisfy  conditions  (5.95)  and  (5.96).  Given 
that  the  system  is  in  state  1 at  time  A:  = 0 the  initial  state  probability  vector  is 


From  (5.99) 

P(z)  = (I  - 4>z"‘)"'p(0) 


Evaluating  the  inverse  matrix  by  cofactors  we  have 
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Hence, 


P(z)  = 


1 - 7 * + -^z 

>l(2)“ 

1 

16 

P2U) 

-L,-l  - 1,-2 

11- 4)7-1 1 

2^  4^ 

Xz-2 

12 

As  was  mentioned  earlier  there  exists  for  small  systems  a variety  of  ways  in 
which  the  above  matrix  can  be  evaluated.  For  discussion  purposes  we  shall 
employ  the  methods  of  long  division  and  factoring.  The  method  of  long  division 
gives 


P(z)  = 


1 + .5625Z-1  + .4831Z-2  + .4379z"3  + .4141z^  + .4074 z"5  + ..T 
0 + .57-1  + .53817-2  + .54927-3  + .55457-^  + .5574z-3  + .. 

0 + 0 + .08337-2  + .13027-3  + .15487^  + .16767-3  + 


where  the  decimal  values  are  rounded  off  at  the  fourth  place.  From  (5.73)  the 
coefficients  of  z~^  give  the  probability  amplitudes  Pj{k)  directly;  thus  we  have 
the  discrete-time  histories  of  the  three  components  of  p{k): 


Pl(0)  = 1 
Pl(l)  = .5628 
Pl(2)  = .4831 
Pl(3)  = .4379 
Pl(4)  = .4141 
Pl(5)  = .4074 


P2(0)  = 0 

P2O)  = -5 

P2(2)  = .5381 
P2(3)  = .5492 
P2(4)  = .5546 
P2(5)  = .5574 


PjCO)  = 0 
PjCl)  = 0 
Pj(2)  = .0833 
P3(3)  = .1302 
P3(4)  = .1548 
P3(5)  = .1676 


The  above  state  probabilities  as  a function  of  discrete-time  are  shown  graphically 
in  Fig.  5.5. 

We  observe  that  as  k becomes  larger  the  probability  of  the  system  remain- 
ing in  state  1 decreases  sharply  and  approaches  a probability  level  of  0.4.  Knowl- 
edge of  the  initial  state  (k  = 0)  becomes  less  significant  with  time.  The  probabil- 
ity at  time  it  > 0 of  the  system  being  in  a state  other  than  state  1 is  0.6.  In  fact 
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the  system  is  in  state  2 with  probability  greater  than  0.5  and  in  state  3 with 
probability  0.16.  (The  fact  that  the  total  probability  exceeds  unity  is  attributed 
to  rounding-off  of  decimal  quantities  in  the  calculations.) 

As  an  alternative  method  for  evaluating  the  matrix  P(z)  of  our  example 
system  we  employ  the  method  of  partial  fractions.  The  system  determinant  can 
be  factored  as 


II- 


> - If'-'  ^ 


j_ 

48 


Z-3 


= z-3(z-  l)(z-  .522Xz-  .0398) 


thus 


1 - z * + 


P(z)  = 


P 

(z  - l)(z-  .522Xz-  .0398) 


j 

i 


■i 

I 

S 

i 
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The  inverse  of  P(z)  gives  the  components  of  the  state  probability  p(A:)  as 


Pit*)  = (l)*=o  + 1.410+ .2680(.522)*  + .3166(.0398)*]*>0 
P2(k)  = (.5494-  .0574(.522)*-  .4874(.0398)*]^>0 
Py(k)  = [.1819-  .3623(.522)*-  .1809(.0398)*]^>0 

A few  typical  values  of  the  components  of  p(*)  are 


Pi(0)  = 1 

P2(0)  = 0 

P3(0)  = 0 

p,(l)  = .5625 

P2O)  = .5 

P2O)  = 0 

Pj(2)  = .4835 

P2(2)  = .5331 

P3(2)  = .0835 

Pi  (3)  = .4481 

P2(3)  = .5413 

P3(3)  = .1314 

Pj(4)  = .4299 

P2(4)  = .5452 

P3(4)  = .1551 

II 

o 

P2(°°)  = .5494 

P3(-)  = .1819 

which  compare  favorably  with  the  results  achieved  by  the  method  of  long 
division. 

Analysis  by  partial  fractions  provides  for  system  insight  not  readily  attain- 
able through  analysis  by  long  division.  We  have  seen  by  both  methods  that  the 
components  of  the  state  probability  vector  p(A:)  must  not  exceed  unity.  Since 
\Pj{k)\  < 1 for  all  i and  k it  follows  from  the  method  of  partial  fractions  that 
none  of  the  eigenvalues  of  can  have  a magnitude  greater  than  unity . Thus  all 
roots  of  magnitude  less  than  unity  contribute  transient  terms  which  vanish  as 
k->-°°.  Since  every  Markov  system  must  have  at  least  one  ergodic  set  it  also 
follows  that  every  Markov  system  must  have  at  least  one  root  equal  to  unity.  In 
fact  the  characteristic  equation 


II-  = 0 


of  a Markov  system  will  have  as  many  roots  equal  to  unity  as  there  are  ergodic 
sets  in  the  system. 


1 
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5.7  TIME-VARYING  CONTINUOUS-TIME  SYSTEMS:  TIME  DOMAIN 
ANALYSIS 

For  time-varying  linear  systems  the  differential  equations  to  be  solved  are  j 

] 

x(0  = A(r)x(r)  + B(r)u(r) 
y(r)  = C(0x(0  + D(f)u(r) 

Tlie  homogeneous  form  for  the  dynamical  equation  is 

j 

Jjx(0  = A(/)xW  (5.100)  ^ 

ji 

We  begin  the  solution  to  (5.100)  by  assuming  that  the  components  of  x(/)  are  ’ 

linearly  related  to  the  components  of  x(ro).  Using  definition  (3.35)  this  relation-  J 

ship  for  zero  input  is  i 


Xi(t)  = (5.101) 

/ 

wherein  the  initial  conditions  are  established  as 

X/(/o)  = 0 for  i j 
X/(/o)  ^ 0 

Thus  (5.101)  represents  the  solution  for  an  arbitrary  state  by  superposition  of 
the  initial  conditions.  In  matrix  form  (5.101)  becomes 

x(f)  = <!>(?, /o)x(fo)  (5. 1 02) 

Returning  to  the  dynamical  equation  (5.100),  if  it  is  of  first  order  we  can 
write  in  scalar  form 

(ix 

^ = a(t)dt  (5.103) 


-] 
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Integrating  both  sides  of  (5.103)  from  /q  f gives 
xU)  _ 

-r 


In 


xUo) 


a(k)d% 


xU)  = jr(to)exp^  J a(|)r/A 


(5.104) 


By  (5.104)  it  is  inferred  that  for  first  order  systems  the  fundamental  matrix  is 


V5(f,fo)  = exp(  r fl(|)r/| 
■^'0  / 


(5.105) 


This  implies  that  for  the  initial  conditions  we  let 


'V/(0  = 'fil{tjQ)Xl{tQ) 

X/(fo)  = 0 for  /■  i 
X/(?o)  = 1 

Extending  (5 .105)  to  higher  order  systems  we  have 


<I>(/,/o)  = expj  f A(|)d| 

\-''o  y 


(5.106) 


Relation  (5.106)  is  valid  only  if  A(t)  and  tlie  integral  equation  commute,  i.e.,  if 


A(0  f A(0di  - f Aa)A(r)c/^  = 0 (5.107) 

-'to 


■'to 


Clearly,  for  the  case  where  A(f)  is  the  product  of  a constant  matrix  and  a scalar 
time  function,  such  as 


A(0  = /(t)A' 
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where 


i 


A(?)r/|  = A'  r md% 
to  ''to 


then  identity  (5.107)  holds.  Thus,  we  let 


<P(t.to)  = exp 


(5.108) 


In  general,  however,  it  is  not  possible  to  obtain  an  analytic  expression  for  the 
fundamental  matrix  of  a time-varying  linear  system.  It  is  primarily  of  conceptual 
interest. 

To  find  the  complete  solution  to  the  dynamical  equation  we  again  refer  to 
the  method  of  variation  of  the  parameter.  Proceeding  in  a manner  analogous  to 
that  of  the  previous  section  the  solution  x(/)  is  assumed  to  have  the  form 


x(0  = <!>(/, to)f(0 


(5.109) 


where  f(r)  is  to  be  determined.  The  time  derivative  of  (5.109)  is 
x(r)  = i(f,/o)f(0  + <l>(f.fo)f(0 


which,  when  substituted  into  (2.26),  gives 


[4>(f,ro)- A(/)4>(f.^o)lf(/)  + <h(r,/o)f(/)  = B(/)u(0 

The  expression  in  the  bracket  is  the  homogenous  equation  and  is  identically 
zero,  leaving 


<I>(f,/o)nf)  = B(0u(t)  (5.110) 

Multiplying  both  sides  of  (5.110)  from  the  left  by  the  inverse  of  <l>(r.ro)  ^nd 
integrating  we  obtain  the  expression  for  f(/): 


f(0  = 


J 4>-'(?To)B(?)u(J)r/? 
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5.8  TIME-VARYING  DISCRETE  TIME  SYSTEMS:  TIME  DOMAIN 
ANALYSIS 

The  normal  form  of  the  equations  governing  a linear  time-varying  discrete-time 
system  is 

x(k  + 1)  = A(k)x{k)  + B(^)u(A:)  (5.1 14) 

y(A:)  = C(it)x(/t)  + D(/t)u(it)  (5.  II 5) 

As  shown  before  the  solution  to  (5.1 14)  can  be  found  directly  by  recursion: 
x(l)  = A(0)x(0)  + B(0)u(0) 
x(2)  = A(l)x(l)  + B(l)u(l) 

= A(l)A(0)x(0)  + A(l)B(0)u(0)  + B(l)u(l) 
(5.116) 


For  k>S.we  define 

k-l 

^(.k,i)  = n A(0  = A(fc  - 1 )A(*  - 2) . . . A(E  + 1 )A(6)  (5.117) 

l=S 

and 


Hk,k)  = \ (5.118) 

Thus  the  equation  of  state  for  a non-stationary,  discrete-time  system  can  be 
written  as 

x(k)  = <P(k,0)x(0)  + Z]4>(it,e+l)B(C)u(C)  (5.119) 

v=0 


Substituting  (5.1 19)  into  (5.11 5)  the  output  expression  becomes 


Y(k)  = C(k) 


k-l 


<t>(k,o)x(o)  + E<p(k,e+i)B(e)u(e) 

8=0 


+ D(k)u(k)  (5.120) 


For  zero  initial  conditions  the  system  output  is 


k-l 


Y(k)  = Z;C(A:)4)(A:,C  + I)B(«)u(C) 
8=0 


(5.121) 


X 
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From  our  previous  definition  of  the  transfer  function  matrix  //,  where 

y = Hu 

the  weighting  sequence  matrix  H for  non-stationary  systems  has  the  form,  from 
(5.121), 

H(A:,A:-e)  = C(A:)4>(yt,C  + 1)8(6)  (5.122) 

From  (5.1 17),  for  A:  > 8, 

Ic-I  k-\  m-\ 

nA(r)  = HaO)!!  A(n)  (5.123) 

r = C i=m  n=C 

Thus,  the  expression  for  <I»  becomes 

$(/t,C)  = 4>(A:,w)4>(m,C)  (5.124) 

For  k = 9.  equation  (5.124)  gives 

$(A:,A:)  = 4>(^,m)'I>(/n,C)  = I 

For  k<S.  we  define 

^>(*,6)  = A~1(*)A"1(^  + 1)-A“1(6- 1)  (5.125) 

Provided  the  inverses  of  lk{k)  exist  we  conclude  that 

^(m,k)  = ^~^(k.m)  (5.126) 

Hence,  the  weighting  sequence  matrix  (5.122)  can  be  written  as  the  product 

H(*,*-6)  = H(A:)H(6)  (5.127) 

where 

H(fe)  = C(fc)<h(fc,0)  (5.128) 

H(6)  = <KO,8  + l)B(6)  (5.129) 


! 


T” 


r 

6 

Controllability,  Observabiiity 
and  Stabiiity 


6.1  NOTATION 

. In  certain  analytical  problems  involving  the  scalar  product  of  two  complex 

\ vectors  the  order  of  the  vectors  in  the  scalar  product  is  important.  To  highlight 

I ‘ this  importance  and  track  the  ordered  vector  pairs  we  adopt  the  Dirac  bra-ket 

. (or  <-l->)  vector  notation.  However,  as  the  need  arises  for  clarity  or  brevity  we 

will  occasionally  revert  to  the  original  notation. 

■ In  general  for  any  two  complex  vectors  x and  y belonging  to  the  same 

; vector  space  their  scalar  product  may  be  such  that 

■. 

* X -y  y • X 

‘ » or  in  Dirac  notation 

<x|y>  ^ <ylx> 
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Since  the  absolute  value  of  the  scalar  product  is  independent  of  the  order  of  the 
factors  we  can  write 

(x|y>  = <y|x>* 

<x|Xy>  = X<x|y> 

<Xxly>  = X*(xly>  (6.1) 

where  X is  a constant,  not  necessarily  real,  and  the  asterisk  denotes  the  complex 
conjugate. 

Consider  an  nXn  Hermitian  operator  A having  distinct  eigenvalues 
Xi,X2,...,X„.  Let  the  associated  ket  (or  right)  eigenvectors  be  |e]>,|e2>,...,|e„>. 
The  corresponding  eigenequations  can  be  written  as 

A|e,->  = X,|e,>  / = 1,2, ...,w  (6.2) 

where  each  |e,>  is  defined  by  the  normalized  basis  vectors  (*c,-,2e,,...,"c,),  i.e., 

|s/>  = (>e,-,2e,-,...,"e,)  (6.3) 

It  can  be  shown  that  the  eigenvectors  |ei>,|e2>,...,|e„>  are  linearly  independent 
and  therefore,  also  form  a basis  for  the  «-dimensional  space.  Hence  any  vector  in 
the  space  spanned  by  |e,->  (/=  l,2,...,n)  can  be  written  in  terms  of  the  eigen- 
vectors. For  the  right  vector  |x>  we  can  therefore  write 

n 

lx>  = H Xi  |e,->  (6.4) 

1 = 1 

The  respective  bra  or  reciprocal  basis  vectors  corresponding  to  those  of 
(6.3)  are  <ej|,(e2l,....<e„l.  Together  the  two  sets  of  basis  vectors  satisfy  the 
orthonormality  relation 

(e/iey)  = hij  (6.5) 

For  any  two  specific  vectors  |e,->  and  <e,  | the  term 

le,-Xe,| 
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is  called  a dyad.  Provided  they  have  been  normalized  to  satisfy  condition  (6.5) 
the  sum  of  the  dyads  is  the  identity  matrix,  i.e., 

Hle.Xe/l  = I (6-6) 

i = \ 

which  is  the  closure  relation. 

Corresponding  to  the  right  vector  (6.4)  we  can  write  for  the  left  vector 

n 

<x|  = Z!x,<e,l  (6-7) 

1 = 1 

Using  identity  (6.6)  equations  (6.4)  and  (6.7)  can  respectively  be  written  as 

n 

lx)  = H le,Xe/lx>  (6.8) 

1 = 1 

n 

(x|  = Z!  (x|e,Xe,|  (6.9) 

1 = 1 

The  scalar  product  of  any  eigenvector  <e/l  and  |x>  becomes 

n 

<e,lx>  = 2Z<ey|e,)x,-  (6.10) 

Essentially,  the  dyads  of  (6.6)  are  n X n matrices  with  diagonal  elements 
Each  term  in  the  sum  is  a projecUon.  Therefore,  the  eigenvalue  problem 
as  formulated  by  (6.2)  can  be  stated  as  one  whereby  for  a given  Hermitian 
matrix  operator  A the  associated  n-dimensional  vector  space  is  decomposed  into 
a complete  set  of  orthonormal  vectors  such  that  A is  a linear  combination  of  the 
projections  (dyads).  This  is  clearly  seen  by  expanding  (6.2),  i.e.,  multiplying 
both  sides  from  the  right  by  <e,j  and  summing: 

Ale,)  = X,|e,> 

n 

a2Z  |e,Xe,  l = Z]  X/ie,Xe,j 
1=1  1=1 


4 
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n 

A = E\,le,><e,|  (6.11) 

/ = ! 

Since  any  matrix  operator  A can  be  written  as  its  identity 

A = lA  (6.12) 

it  is  apparent  from  (6.6)  that  matrix  operators  can  be  expanded  in  terms  of  their 
respective  eigenvectors.  Thus  the  following  identities  are  possible: 


n n 


A = AZ]|e/Xe/l  = Z!x/|e/><e/| 
i-\  /=1 

(6.13) 

n 

A"1  = E^le/Xe,! 
1 = 1 N 

(6.14) 

n 

|x>  = Z]  |e/Xe/|x> 
/ = 1 

(6.15) 

n 

(si  - A)  = Z]  (s  - X/)|e/Xe/| 
/=1 

(6.16) 

n 

(si  A)  1 H |e,Xe,| 

i=l  (s-X/) 

(6.17) 

n 

= Z]e^'^|e/Xej| 
; = 1 

(6.18) 

n 

/(A)  = E/(X,-)|e,Xe,| 

(6.19) 

1=1 


6.2  SPECTRAL  DECOMPOSITION 

The  state  equations  in  standard  form  were  shown  to  be 

|x>  = A|x>  + B|u> 

|V>  = C|x>  + D|u> 


1 


I- 


• Jt'* 


t. 
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Their  respective  time-dependent  solutions  are 


lx)  = <«/) 


lv>  = C<Wt) 


|x(0)>  + f e-*^B|u(?)>df 
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(6.20) 


lx(0)>  + I e-''‘B|u(?)>df 
0 


+ D|u(r)>  (6.21) 


By  applying  identities  (6.1  5)  and  (6.18)  to  solutions  (6.20)  and  (6.21 ) we  obtain 
the  following  spectral  decompositions : ’ 


V'  " C' 

lx)  = Z!e''''le,><e,|x(0)>  /'■''‘^>ie,XB%,|u(|)>d?  (6.22) 

. = 1 

n n 

ly>  = 11  e^''cie,Xe,lx(0)>  + E 
/=!  (=1 

where  B^  is  the  conjugate  transpose  of  B.  The  scalar  product 
<e,|Bu(|)>  = <B^e,ju(?)> 


I'l 

e'"'*'-‘*C|e,XB^e,|u(?)>d?  + D|u(r)>  (6.23) 

*^0 


applies. 

From  (6.22)  and  (6.23)  two  observations  are  readily  apparent:  (1)  the 
vectors  <B^e,  | may  be  regarded  as  “weights”  which  determine  the  magnitude  of 
the  effect  of  the  input  on  the  change  in  state  in  the  ie,->  “direction,”  and  (2)  the 
vector-valued,  time-dependent  quantities  e^'^le,)  can  be  interpreted  as  the 
system  modes. 


6.3  CONTROLLABILITY 

The  system  theoretic  concept  of  controllability  arises  from  the  following  two- 
point  boundary  valve  problem.  Given  an  initial  state  |x(0)>  at  time  zero  and  a 
final  state  |x>  ¥=  |x(0)>,  determine  whether  it  is  possible  to  find  a time  t and  an 
input  |u>  which  take  the  system  from  its  initial  state  at  time  zero  to  its  final 


1.  Notationally  x=  ix>. 


vv 


1', 


108  AN  INTRODUCTION  TO  THE  THEORY  OF  LINEAR  SYSTEMS 

State  at  time  t,  where  t is  finite.  To  formalize  the  definition  of  controllability  we 
establish  for  the  system  representation 

lx>  = Alx>  + Blu> 

|y>  = C|x>  + D|u>  (6.24) 

and  stipulate  that  at  some  initial  time  zero  the  system  is  in  an  allowable  initial 
state  |x(0)>.  The  system  representation  (6.24)  is  defined^  to  be  completely 
controllable  if  there  exists  a finite  time  / > 0 and  a real  input  \u)  defined  on  the 
time  interval  (0,t)  such  that 


|x>  = e 


Af 


B|u(S)>c/| 


The  necessary  and  sufficient  conditions  under  which  the  system  represen- 
tation is  controllable  are  readily  deducible  from  (6.22),  the  spectral  decomposi- 
tion of  the  state.  These  conditions  are  presented  below  (without  proof)  in  the 
form  of  the  following  theorems:^ 

Theorem  6.1.  Let  the  matrix  A in  (6.24)  have  distinct  eigenvalues.  Representa- 
tion (6.24)  is  completely  controllable  if  and  only  if  for  all  n vectors  of  (6.22)  the 
condition 


<B^e,-|  0 (/•=  l,2,...,n)  (6.25) 

is  satisfied.  (Be,-|  = 0 for  any  i implies  that  the  mode  e,-  cannot  be  excited.  Hence 
the  representation  cannot  be  completely  controlled. 

Theorem  6.2.  Let  A and  B in  (6.24)  be  nXn  and  nXm  matrices,  respectively. 
Representation  (6.24)  is  completely  controllable  if  and  only  if  the  rows  of  the 
matrix 


are  linearly  independent  on  the  time  interval  (0,  r). 


1.  See  Zadeh  and  Polak,  System  Theory,  p.  244.  McGraw-Hill,  New  York,  1969. 

2.  op.  cit. 


\ 
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Theorem  6.3.  Let  A and  B in  (6.24)  he  nXn  and  nXm  matrices,  respectively. 
Representation  (6.24)  is  completely  controllable  if  and  only  if  the  nXnm  con- 
trollability matrix  Q^. , where 

Q(.  = [B  AB  A^B  ...  A"-'B)  (6.26) 

has  rank  n. 

As  a test  of  the  above  conditions  consider  the  simple  system  of  Fig.  6.1 . 
We  desire  to  establish  the  controllability  of  the  system  representation.  By  inspec- 
tion we  have  the  simultaneous  equations 

it  = ATI  + 4x2  ^ “ 

X2  = 5x2  u 
y = XI  - X2 


which  can  be  written  in  matrix  form  as 


Figure  6.1 


1 
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From  (6.26)  the  controllability  matrix  Q^.  is  the  2 X 2 matrix 


Q< 


[B  AB) 


1 5 
1 5 


Clearly,  the  columns  B and  AB  of  Q^.  are  linearly  dependent.  Therefore  Q,;.  doer 
not  have  rank  n (=2).  Hence,  the  system  representation  for  tire  above  example  is 
not  completely  controllable.  However,  with  a slight  modification  to  A (i.e., 
changing  the  amplification  of  state  component  jcj).  such  that 


A = 


I 

0 


5 

5 


the  columns  of  Q,.  become  linearly  independent.  As  a result  the  system  represen- 
tation for  the  above  example  is  now  completely  controllable.  We  have 

6’ 

5 


6.4  OBSERVABILITY 

Changes  in  system  state  as  discussed  above  are  not  directly  observable.  Usually 
what  is  observed  is  the  input  (u)  and  its  detectable  effect  on  the  system  output 
lv>.  This  physical  realization  leads  to  the  following  definition  of  observability. 
We  establish  the  system  representation  as  (6.24).  Let  the  system  be  in  an  allow- 
able initial  state  |x(0)>  at  time  zero.  And  let  the  input  at  time  zero  be  |u>  =0. 
The  system  representation  (6.24)  is  defined^  to  be  completely  observable  if  for 
every  initial  state  at  time  zero  there  exists  a finite  time  t such  that  knowledge  of 
the  output  |y>  over  the  time  interval  (0,/)is  sufficient  to  determine  the  initial 
state  lx(0)>. 

A set  of  necessary  and  sufficient  conditions  under  which  the  system 
representation  (6.24)  is  observable  is  seen  from  (6.23),  the  output  spectral 
decomposition.  These  conditions  are  presented  below  (without  prooO  in  the 
form  of  the  following  theorems:^ 


1.  op.  cit. 

2.  op.  cit. 


- 


1 
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Theorem  6.4.  Let  the  matrix  A in  (6.24)  have  distinct  eigenvalues.  Representa- 
tion (6.24)  is  completely  observable  if  and  only  if  for  the  n vectors  oflb.li)  the 
condition 

Cle,)  / 0 (6.27) 

is  satisfied.  Obviously,  if  C|e,)  = 0 for  some  /,  then  initial  states  of  the  form 
|x(0)>  = ale,)  would  give  zero  outputs  for  zero  input  and  these  states  would  be 
indistinguishable  from  the  zero-state  |x)  = 0.  (Note  that  in  concert  with  (6.25) 
the  system  representation  is  completely  controllable  if  and  only  if  every  mode  of 
its  dual  representation  is  observable.) 

Theorem  6.5.  Let  the  matrices  A,  B,  C and  D in  (6.24)  be,  respectively,  n Xn, 
nXm,  kXn  and  kXm.  Representation  (6.24)  is  completely  observable  if  and 
only  if  the  observability  matrix  Qq  , where 

= [C^Afc^..(A'’-')'fc^l  (6.28) 

has  rank  n.  (The  dagger  denotes  the  conjugate  transpose.) 

As  a test  of  the  above  criteria  we  return  to  the  simple  system  of  Fig.  6.1 . 
The  matrix  C is 

C = 11  -1] 

By  (6.28)  the  observability  matrix  Qq  is 


Clearly  the  columns  of  Qq  are  linearly  dependent  and  the  rank  of  Qq  is  less  than 
n (=2).  Hence  the  representation  for  the  system  of  Fig.  6.1  is  not  completely 
observable.  However,  with  a slight  modification  of  C (i.e.,  changing  the  amplifi- 
cation of  the  output  X2)  such  that 

C = (1  I) 

the  columns  of  Qo  are  made  linearly  independent  and  the  system  representation 
is  made  completely  observable. 
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6.5  STABILITY-UNFORCED  SYSTEM 

Tlie  system  theoretic  concepts  of'  controllability  and  observability  are  direct 
results  of  the  spectral  decomposition  of  the  state  and  output  vectors.  Both 
concepts  appeal  to  our  intuition.  A third  concept,  somewhat  related  to  the  other 
two  and  also  fundamental  to  the  qualitative  analysis  of  dynamical  systems,  is  the 
concept  of  system  stability . In  defining  stability  we  refer  to  the  system  state  and 
its  behavior  with  time.  By  definition  an  equilibrium  state  is  one  whereby 

lx)  = 0 

|x>  = B 

From  Newton’s  laws  of  motion  an  unforced  system  that  is  initially  in  an  equilib- 
rium state  will  remain  in  that  state  indefinitely  unless  acted  upon  by  an  external 
force,  after  which  time  one  of  several  mutually  exclusive  things  can  happen. 
Consider  the  external  force  to  be  an  impulse.  Tlien,  (1)  the  system  can  be 
displaced  from  equilibrium  and,  by  internal  properties,  returned  to  equilibrium 
within  a small  time  interval.  In  this  case  the  state  is  said  to  be  stable.  (2)  The 
state  can  be  displaced  a finite  “distance”  from  equilibrium  and  remain  at  the 
displaced  position  for  all  time.  In  this  case  the  state  is  said  to  be  unstable  but 
bounded.  Or  (3)  the  state  vector  can  grow  indefinitely  with  time  in  which  case 
the  state  is  both  unstable  and  unbounded. 

The  bounded  aspects  of  system  or  state  stability  can  be  refined  mathe- 
matically by  introducing  the  idea  of  a neigliborhood  surrounding  an  equilibrium 
state.  What  is  intended  by  a neigliborhood  in  this  instance  is  a finite,  fixed, 
arbitrarily  small  displacement  e in  the  Fq  t'f'e  plane  which  surrounds  the  end- 
point of  the  equilibrium  state  vector  (Fig.  6.2).  We  assume  that  the  system 
output  is  the  state  and  that  the  equilibrium  state  in  question  is  the  zero-state 
lx>  = 0 = 0.  Accordingly,  we  define  the  concept  of  stability  of  the  zero-state  or 
zero-input  stability.  Roughly,  the  zero-state  lx>  = 0 is  stable  if  for  every  initial 


Figure  6.2  Neighborhood  surrounding  an  equilibrium  state. 
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State  |x(0)>  sufficiently  close  to  zero  the  corresponding /ret'  motion  x(fo.x(0),/) 
remains  close  to  zero  for  all  / > to  precisely,  we  define  the 

zero-state  |x>  = 0 to  be  stable  in  the  sense  of  Lyapunov  if  for  any  tp  and  any 
e > 0 there  exists  a 6 > 0,  dependent  on  e and  Zp,  such  that 

|x(0)l  < 6 =►  Ixl  < e (6.29) 

Condition  (6.29)  implies  that  the  free  motion  trajectory  x(fp,x(0),f)  remains  in 
the  cylinder  of  radius  e for  all  time. 

Additionally,  we  require  that  the  system  response  eventually  go  to  zero. 
Our  above  definition  of  stability  in  the  sense  of  Lyapunov  does  not  cover  this 
requirement.  Therefore,  we  define  the  stability  of  the  zero-state  |x>  = 0 to  be 
asymptotically  stable  if  (a)  it  is  stable  in  the  sense  of  Lyapunov,  and  {h)  for  any 
Zp  |x(0)>  is  sufficiently  close  to  0 such  that  the  free  motion  trajectory 
x(zp,x(0),  Z) ->  0 as  z-»°°. 

We  have  seen  previously  that  for  the  unforced  system  the  solution  to  the 
dynamic  state  equation  is 


lx>  = 4>(z)|x(0)> 

<1>(Z)  = e'^'  (6.30) 

Thus  it  is  possible  to  represent  the  state  at  a time  subsequent  to  Zp  by  a linear 
transformation  (involving  the  fundamental  matrix  4>)  of  the  initial  state.  There- 
fore. for  such  systems  it  should  be  possible  to  determine  the  conditions  for 
stability  which  depend  only  on  4>.  Accordingly,  in  support  of  the  definitions  of 
zero-state  stability  the  following  theorems^  apply: 

Ok 


1.  The  above  theorems  arc  presented  without  proof.  I'or  a demonstration  of  their  respective 
proofs  see,  for  example,  Zadeh  and  Desoer,  Linear  Systems  Theory,  pp.  .179-391.  pp. 
498-503.  McTiraw-llill.  New  York.  1963. 

2.  Notationally  lx>  = x and  the  two  notations  will  be  used  interchangeably  where  brevity  is 
required. 


- ^ * 
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Theorem  6.0.  The  system  is  stable  in  the  sense  of  Lyapunov  implies  (and  is 
implied  by)  there  exists  a finite  constant  K which  may  depend  on  to  such  that 

l<I>(f.ro)l  < A'  (forall />/o)  (6.31) 

Theorem  6. 7.  The  system  is  asymptotically  stable  if 

(a)  |4>(/.?0)l  < (forall/>fo) 

and(b)  lim  |<I«(/,ro)l  = 0 (for  all /q)  (6.32) 


Theorem  6.8.  The  system  is  stable  in  the  sense  of  Lyapunov  implies  (and  is 
implied  by) 

(a)  All  the  eigenvalues  of  the  constant  matrix  A have  non-real  parts,  and 

(b)  those  eigenvalues  of  A that  lie  on  the  imaginary  axis  are  simple  zeros  of 
the  minimum  polynomial  of  A. 


Theorem  6. 9.  The  system  is  asymptotically  stable  if  and  only  if  all  the  eigen- 
values of  A have  negative  real  parts. 

By  theorems  6.7  and  6.9  above  we  assert  that  for  fixed  systems  described 
by  (6.30)  asymptotic  stability  implies  that 

|x(/o-x(0),0l  ->■  0 

in  an  exponential  manner.  We  therefore  specify  that 

|x(fo,x(0),01  < (6.33) 

where  C is  a constant  and  X is  a positive  number. 

We  can  be  more  specific  regarding  the  above  definitions  and  theorems 
through  a simple  example.  Let  the  matrix  operator  A for  the  unforced  system 
governed  by  (6.30)  have  distinct  eigenvalues  X1.X2 where  each  eigen- 

value has  the  complex  form 


X|  - O]  + /oj| 
X2  = 02  + /W2 

Xfi  “ 0/1 


r 


] 
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nie  real  parts  of  each  eigenvalue  can  be  ordered  as  0|  >02->.  ■>o„.  from 
equation  (5.46)  the  state  transition  matrix  >1'  can  be  writlen  as 

" '"P  r I 

* ^ IP 

1 = 1 f = l ('^-1)' 

where  the  A„  arc  matrix  coefficients  of  the  polynomial  expansion  and  nip  is  the 
root  multiplicity  or  degeneracy  of  the  eigenvalues.  I-or  the  unforced  sysleni  we 
consider  three  situations;  (I ) o,-  < 0.  (2)  o,  > 0.  and  (.^)  a,  = 0.  In  each  case  we 
refei  to  equation  (5.46). 

Case  ( 1 ):  o/  < 0. 

We  see  that  for  r>to  the  absolute  value  of  <l>  remains  finite.  Hie  condi- 
tion of  Theorem  6.6  is  satisfied.  Furtlier, 

lim<l>(/o.t)  = 0 

thereby  also  satisfying  the  conditions  of  Theorem  6.7.  It  necessarily  follows  ihai 
Theorem  6.8  and  6.9  are  also  satisfied.  Clearly  then  for  the  case  where  the 
eigenvalues  have  negative  real  parts  the  state  vector  lx>-»  0 as  r -►  «>.  The  system 
representation  is  asymptotically  stable. 

Case  (2):  Oj  > 0. 

The  time  limiting  value  of  the  transition  matrix  <l>  in  this  case  is 
lim  <J>(/{).r)  = °° 

Therefore  the  state  vector  as  t-*°°.  Hence,  for  the  case  where  the 

system  eigenvalues  have  positive  real  parts  the  system  representation  is  unstable. 

Case  (3):  Oj  = 0. 

When  the  system  eigenvalues  are  purely  imaginary  two  situatir  ns  arise: 

(a)  The  system  has  simple  roots,  in  which  case  the  A;.,  are  constant  matrices 
and  <I>  remains  finite.  Thus  in  accordance  with  Tlieorem  6.6  the  system 
representation  is  stable. 

(b)  The  system  has  multiple  roots,  in  which  case  the  A„  are  polynomials  in  t 
with  matrix  coefficients.  Clearly 

lint  4>(ro-H  = °° 


Hence,  the  system  representation  is  unstable. 
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6.6  STABILITY-FORCED  SYSTEM 

Recalling  the  state  dynamical  equation  and  its  solution  we  have  for  the  forced 
system 

|x>  = A|x>  + B|u> 


ix>  = m 


|x(0)>+  f <!>(?) Blu(?)>cfS 


For  discussion  purposes  we  specify  that  the  system  is  initially  at  rest.  Let  the 
initial  state  at  time  zero  be  the  zero-state,  i.e.,  lx(0))  = 0,  after  which  time  the 
system  is  perturbed  by  an  impulse.  We  examine  the  system  response  as  it  relates 
to  the  concept  of  stability.  For  this  situation  the  solution  to  the  dynamical 
equation  becomes 


lx>  = I <!>(/- |)B|u(?))t/| 
^0 


= f Giuam 

•^0 


where 

G = 4>{t-?)B 

The  elements  of  the  matrix  G are  of  the  form 


(6.34) 


(6.35) 


k=\ 


(6.36) 


* 


Since  the  vector  |u>  is  a column  matrix  the  relationship  between  the  elements  of 
the  integrand  and  the  state  vector  is  established  as 


I 
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Expressing  the  elemental  input  as  an  impulse 
w/  = 5{/-|) 

Ui  = 0 for  / /■ 

the  components  of  the  state  vector  become 

Xi  = gii  (/■=  l,2,...,n) 

• Thus  G can  be  interpreted  as  the  matrix  of  impulse  responses  of  the  state.  For 

such  a system  we  define  stability  as  follows:  A forced  system  is  stable  relative  to 
a set  of  bounded  inputs  if  and  only  if  the  state  is  bounded  for  all  inputs  |u>(t)  in 
' the  set  for  all  t>  to-  Accordingly  we  require  that 

\x(t)\  < K <«•  (6.38) 

i 

where  A is  a finite  constant.  In  support  of  the  above  definition  the  stability 
criterion  for  a forced  system  is  established  by  the  following  theorem:* 

Theorem  6.10.  A forced  system  is  stable  with  respect  to  a set  of  bounded  inputs 
j . if  and  only  if 

f < Ay  < - (6.39) 

''to 

for  every  to  and  all  / > ?o- 

I 

I 

We  illustrate  through  a simple  example  the  concept  of  stability  as  it  applies 
to  a forced  system.  Consider  the  representation  of  the  system  in  Fig.  6.2.  The 
[ system  matrices  are 


'l  4' 

1 

A = 

0 5 

B = 

1 

C = 

[1  -1 

1 0 = 

[1| 

\ 

I 


I . Op.  cit. 
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The  State  transition  matrix  is 


<}>  = e*' 


= I + A/  + 


(A/)^ 

2! 


+ ... 


‘l  o' 

■ 1 4‘ 

■ 1 24' 

+ 

t + 

0 1 

0 5 

0 25 

We  desire  to  examine  in  closed  form  the  time  dependency  of  the  fundamental 
matrix.  This  can  readily  be  accomplished  for  simple  systems  by  converting  to  the 
frequency  domain  then  back  to  the  time  domain.  Thus 

£{<1.(0}  = [sl-A]-l 

s-1  -4  ]■* 

0 s-5 

1 4 

s - 1 (s  - 1 X*  “ 5) 


Converting  back  to  the  time  domain  the  fundamental  matrix  becomes 
4>(f) 


0 


p5r 


Hence  the  product  G = 4>B  is 


It  is  readily  apparent  that  for  the  system  of  Fig.  6.1  the  positive  nature  of  the 
elements  of  <I>  and  B will  cause  all  the  non-zero  terms  to  grow  beyond  bounds  as 
t becomes  infinite.  Therefore,  the  system  representation  for  the  example  chosen, 
under  both  forced  and  unforced  conditions,  is  unstable.  This  conclusion  should 
be  no  surprise.  Earlier  it  was  shown  that  the  representation  for  the  system  of 
Fig.  6.1  was  not  completely  controllable  and  not  completely  observable.  How- 
ever, with  slight  modifications  we  were  able  to  render  the  representation  as 
completely  controllable  and  completely  observable.  We  now  logically  inquire  as 
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to  whether  or  not  similar  modifications  can  be  made  in  the  case  of  system 
stability -under  forced  and  unforced  conditions. 

In  the  case  of  the  unforced  system  the  conditions  for  stability  are  manifest 
in  equations  (6.31)  through  (6.33).  Equations  (6.38)  and  (6.39)  represent  the 
conditions  for  stability  of  the  forced  system.  All  of  these  conditions  can  be 
satisfied  in  a variety  of  ways;  the  objective  being  to  maintain  a finite  state  vector 
for  bounded  inputs  for  all  f>/o-  An  unstable  representation  of  an  unforced 
system  does  not  necessarily  mean  an  unstable  representation  of  that  same  system 
when  an  input  is  applied.  The  terms  giving  rise  to  the  instability  may  not  be 
excited  by  the  input.  Nor  does  it  necessarily  follow  that  when  the  conditions  for 
stability  are  met  then  the  system  is  completely  observable  and/or  completely 
controllable.  Similarly,  a stable  representation  of  an  unforced  system  does  not 
necessarily  imply  a stable  representation  of  the  forced  system.  We  demonstrate 
these  important  facts  through  a simple  example.  Referring  to  the  system  of  Fig. 
6.1 , we  render  the  system  stable  by  changing  the  representation  to  that  shown  in 
Fig.  6.4.  Essentially,  the  changes  are  in  the  magnitudes  of  signal  amplification. 
The  modified  system  matrices  become  * 


’ 1 

1 ' 

1 ' 

A = 

0 

-1 

B = 

-2. 

C = 

(1  1] 

D = 

HI 

The  corresponding  state  transition  matrix  and  impulse  response  matrix  G are, 
respectively , 


e'  j(e'  - e 0 

e ^ 

G = 

0 e~‘ 

_2,-t 

Figure  6.4  Sy,stcm  wherein  the  input  does  not  excite  the  unstable  mode. 

1.  After  Schwarz  and  Friedland,  Linear  Systems,  p.  381 , McGraw-Hill.  New  York,  1965. 


120 


AN  INTRODUCTION  TO  THE  THEORY  OF  LINEAR  SYSTEMS 


Due  to  the  positive  nature  of  the  elements  | and  1^12  we  have 

lint  |<l>|  = °o 
/ -*<» 

Thus,  the  representation  for  the  system  of  Fig.  6.4  is  unstable  when  no  input  is 
applied.  On  the  other  hand  the  impulse  response  contains  the  diminishing  terms 
e~‘  in  both  elements^!  and ^2-  Therefore  the  integrals 


both  remain  finite  for  all  finite  Consequently,  the  representation  of  the 

system  of  Fig.  6.4  is  stable  when  an  input  is  applied.  This  apparent  paradox  is 
resolved  when  one  examines  the  controllability  and  observability  matrices  for 
the  system,  under  both  forced  and  unforced  conditions: 


Unforced  system 


u = 0 
B = 0 
= 0 


Conclusions; 

1.  unstable 

2.  not  completely  controllable 

3.  completely  observable 


Forced  system 


1 . stable 

2.  not  completely  controllable 

3.  completely  observable 


Thus,  to  resolve  the  above  paradox  it  is  concluded  that  the  system  input  does 
not  excite  the  unstable  mode.  Botli  modes,  however,  are  observable.  The  un- 
stable mode  is  observed  when  no  input  is  applied.  The  stable  mode  is  observed 
when  the  input  is  applied. 


6.7  COMMENT 

The  central  idea  of  the  past  discussion  has  been  system  representation  in  state- 
space.  The  algebra  of  rational  functions  (polynomials),  linear  vector  spaces,  and 
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the  methods  of  Fourier  and  Laplace  transforms  were  the  principal  analytical 
tools.  It  was  assumed  that  knowledge  of  the  system  matrix  operators  and  vari- 
ables was  complete.  However,  it  must  be  recognized  that  for  large  systems  this 
assumption  is  short-lived.  For  large  systems  complete  identification  of  all  the 
variables  and  their  operator  elements  is  not  practical,  and  therein  lies  the  prob- 
lem of  applying  the  linear  theory  to  practical  problems  dealing  with  systems  of 
significant  size.  Consequently,  a deterministic  analysis  for  such  systems  gives 
way  to  the  more  realizable  probabilistic  or  statistical  analysis.  Accordingly,  the 
reader  is  referred  to  the  methods  of  Markov  chains,  diakoptics,  sparse  matrices, 
fuzzy  sets,  statistical  mechanics,  etc.,  which  bear  on  this  problem. 


7 

Statistical  Systems-Signals 
in  Noise 


>' 

A 

7.1  INTRODUCTION  j 

It  is  important  to  recognize  the  practical  limitations  of  the  theory  developed  this  | 

far.  Certainly,  a deterministic  analysis  is  possible  for  simple  differential  systems,  ; i 

where  all  the  descriptors  satisfying  the  input-output-state  relations  are  known.  j 

However,  for  large  complex  systems  all  the  state  variables,  the  elements  of  the  i 

fundamental  matrix,  etc.,  are  not  known  nor  can  they  even  be  defined  in  some  j 

cases.  The  theory  addresses  a small  part  of  the  real  (nonlinear)  world.  Conse-  I 

quently,  a deterministic  analysis  of  many  real  systems  is  either  not  a simple  task  : | 

or  not  feasible  using  the  linear  theory  developed  thus  far.  ' | 

As  a method  for  augmenting  the  theory  we  turn  to  statistical  or  proba-  i 

bilistic  analysis.  The  input-output  relations  are  treated  as  random  processes.  We  , i 

will  look  at  the  statistical  properties  of  each  process,  where  the  relations  be-  j 

tween  these  properties  will  form  the  basis  of  the  analysis.  The  most  important  of 

these  are  the  mean  value  and  correlation  function.  Our  attention  will  specifically  j 

focus  on  the  filtering  problem  and  stochastic  processes,  i.e.,  on  signals  u(t)  that  ] 
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are  described  by  their  averages  rather  than  signals  u(t)  that  are  described  by  tlieir 
point  properties.  Appendix  F contains  a short  summary  of  some  of  the  proba- 
bilistic expressions  which  will  be  used  in  the  sequel. 


7.2  AVERAGES  (MEAN  VALUE)  AND  CORRELATION  FUNCTION 

As  a prelude  to  the  discussion  to  follow  we  consider  two  different  ways  to 
represent  an  average.  They  are  the  time  average  and  ensemble  average.  Both  lead 
to  the  same  results.  The  time  average  <u(f)>  of  the  signal  u(t)  is  defined  as 


iuit))  = 


lim 

r-- 


J_ 

2T 


f 


i/(t)dt 


(7.1) 


where  T is  the  time  interval  This  limit  is  a number  associated  with  the 

functional  u{t).  On  the  other  hand  the  average  of  a random  signal  is  interpreted 
differently.  Assume  that  the  set  {ti(f)}  is  a stochastic  process  as  defined  in 
Appendix  F.  For  a given  time  t,  the  sample  function  u(t)  is  a random  process 
resulting  in  the  random  variable  f.  The  expected  value  of  the  random  variable 
will  be  denoted  as  /f  {«(/)};  it  is  the  ensemble  average  of  the  random  process 
{u(/)}.  We  have 


(7.2) 


= l(t) 


(7.3) 


J af(aj)dt  (7.4) 


where  f{aj)  is  the  probability  density  function  associated  with  the  value  a of 
the  random  variable,  and  f(a,t)cla  is  the  probability  that  a will  be  found  in 
a ± da.  \n  conjunction  with  the  above  it  follows  that 


'{a\t)da  = 1 (7.5) 


In  general  <«(/)>  and  A'{i/(r)}  are  unrelated.  (i<(/)>  is  a constant,  whereas 
l^iiLU)}  depends  on  t.  However,  if  the  process  is  stationary'  (see  Appendi.x  F) 


I 


■-•m. 
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and  satisfies  certain  ergodic  conditions,  then  the  time  average  iu(t))  performed 
on  each  member  of  the  set  is  almost  certainly  a constant  equal  to  /•,'{«(/)},  i.e.. 

<«(/)>  = A'{m(/)}  (7.6) 

The  correlation  function  R(%)  for  the  process  i±{t)  is  defined  by  (F.8)  as 


«(«)  = (7.7) 

It  is  assumed  that  u(t)  is  a stationary  stochastic  process  where  u_{t)  = 0.  Expres- 
sion (7.7)  characterizes  the  statistical  relationship  between  the  values  of  «(r)at 
times  t and  r-  where  | can  be  positive  or  negative.  It  is  tacitly  implied  that  as 
i increases  the  statistical  coupling  between  u(t)  and  u(t  - ?)  becomes  weaker. 

Since  u(t)  is  stationary  the  mean  values  u[t),  u}(t),  u{j  - ?)«(/).  etc.,  are 
independent  of  / and  depend  only  on  Therefore, 


u(t-^)uU)  = u(-m0)  = «(0)«(|) 

(7.8) 

««)  = /?(-?) 

(7.9) 

For  two  processes,  say  u{t)  and£(r),  the  cross  correlation  function  /?„,,(?) 
is  defined  as 


R,n\^)  = (710) 

Expression  (7.10)  is  unlike  (7.7)  in  the  sense  that  it  is  not  an  even  function: 

«,„■(?)  = «vu(-S)  (7.11) 


where 


«y«(-?)  = v(f)«(f  - ?)  = - ?M0  (717) 

In  a manner  similar  to  defining  the  average  or  mean  value  two  different 
ways  we  examine  the  time  average  alternative  to  (7.7).  The  process  u{t)  is  said  to 
be  ergodic  if  all  its  statistics  can  be  determined  from  a single  sample  function  of 
u{t).  Consequently,  for  ergodic  processes  the  (time  average)  correlation  function 
R(%)  is  defined  as,  (see  (F.l  7)), 


/?(?)  = 


(7.13) 


■■iiii 


4^^  • "V  >• 


.A. 
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(7.14) 

(7.15) 


7.3  FILTERING  PROBLEM 

The  problem  of  filtering  can  be  stated  as  follows:  Let  the  time  dependent  input, 
applied  at  the  system  (filter)  input  terminal,  be  represented  as  the  random 
process  u(t)  (see  Figure  7.1).  It  is  the  sum  of  a useful  signal _s(r)  and  noise  n(r), 
both  of  which  are  random  processes; 

u{t)  = s(t)  + n(t)  (7.16) 

The  system  //  operates  on  the  input  resulting  in  the  output  j^t).  We  have 

m = M[u(t)\ 

- + "(01  (7.17) 

The  problem  is  to  choose  //  such  that  it  reproduces  some  meaningful  function, 
say  m(t).  with  least  possible  error.  The  function  m(t)  is  the  transformed  version 
of  the  signal  ^(/): 


m{t)  = H[gt)]  (7.18) 

In  simple  filtering  H is  the  unit  operator  1 , in  which  case  (7.18)  reduces  to 

«(0  = 1(0  (7.1*^) 

The  instantaneous  error  of  reproduction  e(r)  is  the  difference 

e(r)  = 1(0  “ m(t)  (7.20) 

= 7fll(0  + "(0]  * >n(t)  (7.21) 


«(/) 


\ .. 

IJ 

/ p 

11 

^.4 


Figure  7.1 
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Clearly,  e(r)  is  also  a ranclojii  process.  The  intensity  of  the  fluctuations  in  the 
process  is  characterized  by  their  mean  square  value 


/f{e2(r)}  = eHt) 


(7.22) 


In  the  absence  of  noise,  where  n(t)  = 0.  our  interest  will  be  in  predicting 
tlie  value  of  s(r)  after  a time  interval 

m(t)  = H[sU+^)\  (7.23) 

The  prediction  is  made  by  considering  past  behavior  of  ^(r)  and  exploiting  the 
statistical  properties  of  the  random  process. 

The  discussion  to  follow  will  focus  exclusively  on  linear  filters.  For  such 
filters  the  input-output  relations  can  be  written  as 

y(t)  = H\u(t)\ 


=/; 


h{^)u(t-^)d^ 


f 


hit  - ?)u(?)c/? 


(7.24) 


(7.25) 


where  h(t)  is  the  weighting  function  or  impulse  response  of  //.  Filters  charac- 
terized by  either  (7.24)  or  (7.25)  are  time  invariant  or  stationary  filters.  The 
analysis  will  be  noise  suppression  by  linear  filtering.  Specifically,  the  filter  will 
be  the  type  where  u(t)  is  stored  for  a time  J then  processed.  It  appears  at  the 
output  as  v<r).  (Such  systems  resemble  computers  which  store  input  data, 
process  it,  then  deliver  the  processed  data  to  an  output  terminal.)  To  fomi^fr) 
the  val^s  u(|)  for  all  | are  used.  We  seek  the  best  system,  characterized  by  //. 
where  is  a minimum. 

As  a matter  of  convenience  we  will  use  u to  mean  u and  similarly  for  all 
other  quantities  of  the  process.  Wliere  clarification  is  required  the  context  will  so 
state. 


I 


7.4  OPTIMUM  LINEAR  FILTER 

The  accuracy  with  which  the  useful  signal  is  reproduced  is  determined  by  the 
mean  square  error  e^,  where  e is  defined  by  (7.20).  The  filter  //  which  minimizes 
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the  mean  square  error,  wlierein  the  output  reproduces  the  meaningful  mU).  is 
called  an  optimum  filter.  To  determine  II  we  first  establish  from  (7.24)  the 
input-output  relationship  for  the  optimum  filter: 

v(f)  = r iHiMt  - 

oo 

Tlie  corresponding  error  function  is,  trom  (7.20), 

e{t)  = r - m(t)  (7.26) 


Squaring  (7.26)  gives 


e^t) 


ha)u(t  - |)r/? 


2mU)  \ h(l)u(t  ~ ^)d^  + mHt) 


h{^)h{%')u(t  - ^)uU  - i')d^d^' 


hQ)m(t)ii(t  - ^)d^  + m^{t) 


(7.27) 


where  the  square  of  the  first  term  on  the  right-hand  side  was  transformed  into  a 
double  integral.  Forming  the  mean  of  (7.27)  we  have 


e^U) 


■II 


ha)h(^')u{t-^)u(t-^')d^d^' 


/i(J)m(r)u(/ - |)t/|  + m\t) 


(7.28) 


From  definitions  (7.7)  and  (7.10)  we  introduce  the  autocorrelation  function 
/?„(?)  and  the  cross-correlation  function  Rmui^)-  where 


K„(^)  = i/(r)i/(r-  ?) 


(7.29) 


*ik. 
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= K(l  (7.30) 

= inU)tiU-  (7.31) 

Wf  note  that  the  autocorrelation  function 

7<m(?)  = "'(0«i(t  - ?)  (7.32) 

for  J = 0 reduces  to 

«,„(0)  = m^U)  (7.33) 

Thus,  tlie  mean  square  error  for  the  optimum  filter  can  be  written  as 

? = JJ  - :j  ha)f<mu(i)d^  + 7?,„(0)  (7.34) 

Equation  (7.34)  clearly  shows  the  dependency  of  the  mean  square  error  on  the 
impulse  response  functions  and  the  correlation  functions.  It  can  be  shown  that 
has  its  minimum  value  if  and  only  if  the  impulse  response  h{t)  is  a solution  to 
the  equation 

I /i(|')7?»(l  -l)d^  = (?»,»(?)  (7.35) 

*"  —Go 

Thus,  it  now  remains  to  solve  (7.35)  for  /i(|). 

The  solution  to  (7. .75)  will  be  more  convenient  with  the  following  defini- 
tions. Given  the  correlation  function  R(^)  we  introduce  its  Fourier  translorm 
S(cj).  where 

5(w)  = ,‘f  {/?(?)} 

= I e-'^^R(^)d%  (7.36) 

— oo 

Inverting  (7.36)  gives 

«(|)  = ^ / (7.37) 

— oo 
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Hie  fimction  i'(co)  is  tlie  power  spectral  density,  the  meaning  of  whiclt  will 
become  clearer  later.  Similarly,  we  introduce  the  transform  of  the  impulse  re- 
sponse /;(/): 

//(to)  = ‘■.Him) 


= J e-i'^^h(k)d% 


Its  inverse  is 


*(?)  ^ J e‘'^'^n(u3)doi  (7.39) 

Tlie  function  //(to)  is  called  the  system  transfer  function . We  can  now  begin  to 
solve  (7.35)  for /i(to). 

Multiplying  both  sides  of  (7. 32)  by  and  integrating  with  respect  to  ^ 

gives 

j /!(?')/?„(? -?V|'  = J (7-40) 

Letting  the  variable  | - |'  = / the  left-hand  side  of  (7.40)  becomes 

r /r(?')/?„(?-?')^f?'=  f e-f^'ha')d^'j'  e->'^'R^(t)dt 

“-00  _oo 

= //(to)5„(to) 

whereas  the  riglit-hand  side  is  equal  to  Sf„u(u).  Equation  (7.40)  therefore 
becomes 

//(co)5,,(co)  - S,^i/(oj) 

Thus,  the  transfer  function  of  the  optimum  filter  is  simply 

...  . ■^mu(‘*>)  ,-7 /in 


i.e..  it  is  the  ratio  of  the  spectral  densities  5„,„(to)  and  S„(oj). 


J 
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Substituting  equation  (7.35)  into  (7.34)  the  mean  square  error  for  the 
optimum  filter  becomes 

? = R„,{Q)  - JJ  ^')dm'  (7.42) 


In  view  of  transforms  (7.36)  through  (7.39)  we  can  write 


7?/n(0)  ^ 
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We  now  apply  the  formulation  to  the  problem  of  simple  filtering  where 
m{t)  = s{t) 

H = 1 

We  assume,  for  simplicity,  there  is  no  cross-correlation  between  the  useful  signal 
and  noise,  i.e.. 


- 1)  = 0 

The  correlation  functions  and  /?„  can  be  written  as 

^su(^)  = 0 = s{t)s{t  - I)  + s(t)n{t  - ?)  = 

^u(S)  = (s(f)  + «(f)]  [s(f-S)  + «(f-?)l  = /?s(^)  + «„(|) 
Consequently, 


^u(^)  ~ ^s(^)  ^n(^) 

and,  from  (7.41 ),  the  transfer  function  //(to)  is 

5j(to) 

The  corresponding  mean  square  error  becomes,  by  equation  (7.43), 


(7.44) 


2 _ 1 r ■S'^(co)5ff(u)) 

27t  J ^ ■Sj(w)  + 5„(co)  ^ 


(7.45) 


In  the  case  where  the  spectral  densities  and  5„  do  not  overlap  (Fig. 
7.2a)  we  have,  from  equation  (7.44), 

//(to)  = 1,  5j(w)  0 

//(to)  = 0,  5j(co)  = 0 

From  formula  (7.45) 

e"2  = 0 
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Figure  7.2  Filtering  of  random  processes. 


Hence,  simple  filtering,  where  the  useful  signal  and  the  noise  do  not  occupy  the 
same  frequency  band,  takes  place  without  error.  However,  if  and  S„  overlap 
(Fig.  7.2b)  then  error  accompanies  the  filtering.  The  error  is,  in  part,  due  to  the 
noise  in  the  frequency  range  coj  < co  < W2  passing  through  the  filter,  and,  in 
part,  due  to  the  distortion  of  the  signal  resulting  from  attenuation  in  the  fre- 
quency range  co]  < co  < co2-  As  5„  gets  larger  and  gets  smaller  in  the  range 
W|  <w<w2  the  less  this  range  will  be  allowed  to  pass  througli  the  optimum 
filter. 

We  next  examine  the  interesting  case  where 


The  spectral  density  of  the  noise  is  much  larger  than  that  of  the  useful  signal.  In 
this  case  the  transfer  function  is  approximated,  from  equation  (7,44),  as 


//(w)  = 


5„(w) 


The  corresponding  mean  square  error  is,  from  (7.45), 


e2 


2?r 


(w)c/cj  = /?j(0) 


= sV) 

Thus,  when  the  intensity  of  the  input  noise  is  much  greater  than  the  useful  signal 
and  occupies  the  same  frequency  range  then  the  mean  square  error  at  the  output 
of  the  optimum  filter  is  equal  to  the  mean  square  of  the  useful  signal.  This  is  to 
say  that  when  the  input  noise  is  intense  the  intensity  of  ifr)  at  the  output  is 
weak.  In  fact,  we  have,  approximately. 


y(r)  = 0 
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e(0  = s(/) 

since  the  magnitude  of  //  is  much  less  than  unity. 

7.5  SPECTRAL  DENSITY  OF  SIGNAL  AND  NOISE 

We  assumed  in  tlie  simple  example  of  Section  7.4  that  RsrM)  ~ 0.  However,  in 
practice  and,  therefore,  Ss„(w)  is  finite.  Accordingly,  one  can  ask 

what  is  the  physical  significance  of  From  (7.36)  the  spectral  density 

Sj„(to)  corresponding  to  the  cross<orrelation  function  RsnU) 

S„,(to)  = S„j(-cu)  (7.46) 

5.„(-w)  = S*(co)  (7.47) 

'■  For  the  process  «(r)  we  can  write 

R„(%)  = ls(f)+n(f)lls(f-?)  + n(f-S)l 

= RsiX)  + «n(?)  + 7?j„(?)  + Rnsii) 

Therefore 

; S„(S)  = 5s(w)  + S„(cu)  + Ss„(co)  + S„s(o;) 

I = 5j(w)  + S„(tj)  + 2/?e[Sj„(w)) 

f 

Thus,  the  spectral  density  of  the  input  process  consists  of  that  which  is  signal, 
that  which  is  noise,  and  an  added  real  part  2/?e(5j„(to)l  called  the  interference 
1 intensity  caused  by  the  statistical  coupling  between  s(t)  and  n(t).  The  imaginary 

I part  of  Ssn(cu),  which  identifies  the  phase  of  the  statistical  relationship  between 

1 s(t)  and  «(f ),  has  no  explicit  physical  meaning. 

i 7.6  POWER  SPECTRAL  DENSITY  AND  CORRELATION  FUNCTION 

! ■ Consider  the  stationary  stochastic  process  of  Appendix  F.  Let  the  process  be 

ergodic  where  the  sample  function  u{t)  contains  all  the  characteristics  of  the 
process.  For  real  u(t)  with  Fourier  transform  (/(w)  the  energy  E of  u(t)  is 


i 
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The  quantity  |t/(a>)P  is  the  energy  density  spectrum  whereas  u^(t)  is  the  instan- 
taneous power  associated  with  u(t).  The  average  power  P of  u{t)  is 


I 

</>>  = />=  lim  u\t)dt 


It  is  possible  from  (7.48)  and  (7.49)  that  the  energy  of  u(t)  can  be  infinite  and 
die  average  power  finite.  If  the  energy  of  u(t)  is  not  finite  it  may  not  have  a 
Fourier  transform.  Therefore  it  is  convenient  to  define  the  truncated  signal 
ur(t): 


u(t)  Itl  < T 

0 |/|  > r 


The  corresponding  Fourier  transform  Uj{oS)  is 


t/7-(tj) 


= /} 


T{t)e  = I u{t)e  '^‘dt 


Thus,  as  T"-*-  the  truncated  sample  function  Uf{t)  approaches  «(/).  The  aver- 
age power  Pj  of  «(/)  over  the  interval  (-T,T)  may  now  be  written  as 


(Pf)  - Pj  - 


Ht)dt  = 


= iU(u)\^/2Tdu 


The  quantity  \U(u)\^/2T  is  the  power  spectral  density  of  u(t)  in  the  interval 
i-T,T). 

In  dealing  with  a random  process  that  is  not  ergodic,  we  may  associate 
with  each  sample  function  u(t)  a truncated  sample  function  UT-(f).  its  Fourier 
transform  Uj(u))  and  its  spectral  density.  The  power  spectral  density  IV(w)  of 
the  random  process  is  defined  as 
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= Urn 

= lim  (7.52) 

7’— ^ ‘ 

i.e.,  it  is  the  limit  of  the  ensemble  average  of  the  power  spectral  densities  of  all 
of  its  truncated  sample  functions.  Accordingly  Uj(co)  is  a random  variable. 

Equation  (7.52)  is  not  in  convenient  form  to  calculate  power  spectral 
densities.  A more  convenient  form  relates  power  spectral  density  to  the  correla- 
tion function.  For  stationary  processes  IV(cj)  is  the  Fourier  transform  of  the 
correlation  function.  We  see  this  by  substituting  the  definition  of  Ujiu)  in 
(7.52),  which  gives 


W(cj)  = lim  :^f/r(w)t/f(w) 

7*— ► oo  ^ * 


The  ensemble  averaging  of  u(ti)  and  u(t2)  is 


u(ti)u(t2)  = RuUl^z) 


u(t-  t\)u(t- t2)  = 7?„{fi  - t2) 
Substituting  (7.54)  in  (7.53)  we  have 

T T 

W(co)=  Um^J  dfij  dt2Ruit\-t2)e~’'^^'''''^^ 


(7.53) 


(7.54) 


(7.55) 


The  double  integral  can  be  written  as  a single  integral  with  the  simple  change  of 
variables. 


k = ti  - t2  = f\  + (2 

d%  = dt\  dk'  = dt2 
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Accordingly,  tor  any  function  g(t\-  ii)  one  can  write 


i'T  , 

.27 

.27-1^1 

dtl 

1 dtzgUi  - ti)  = y 

«(?)c/r 

.7, 

-27  * 

-27+IJi 

= (27-- 

‘'-27- 

Using  this  expression  in  (7.55)  we  get 


(7.56) 

Thus,  for  stationary  random  processes  the  power  spectral  density  is  the  Fourier 
transform  of  the  correlation  function. 

In  comparing  (7.56)  with  (7.36)  it  is  seen  that  from  the  definition  of  Sfco) 
we  have 


h'(co) 


J-f 

27’J 


2T 


(2T- 


.27 

= liin  I ( 1 


S(u>)  = ft'(u)) 

The  inverse  of  (7.56)  immediately  gives 


«„(^)  = 


(7.57) 


(7.58) 


1 


Relationships  (7.56)  and  (7.58),  between  the  power  spectral  density  and  the 
correlation  function,  are  known  as  the  Wiener- Khintchine  equations. 


.M 


J 
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Using  (7.36)  and  (7.37)  we  consider  a few  sample  functions  of  R{^)  and  corre- 
sponding .S’(co): 


7?(?) 


5(oj) 


(a)  /?(0)e-“'^' 

7?(0) 


laRjQ) 
0?  -E  IaP- 


(b) 


(c) 


7t(?(0) 


1 - 

R(fi)  sin  tool 


u>/t 


7T/?(0) 


COq  < to  < top 


wo 

0 otherwise 


where  a is  a real  parameter.  In  each  case  the  functions  R(^)  and  S(oj)  assume 
tlieir  ma.ximum  values  when  ^ = 0 and  w = 0.  We  (loosely)  define  the  correlation 
time  A|  as  the  time  interval  -A|  < ^ < A|  within  which  the  value  of  R(^)  is  of 
die  same  order  of  magnitude  as  R(0).  Outside  this  time  interval  R(^)  is  much  less 
than  7?(0).  For  the  examples  chosen  A|  is  approximately  equal  to  1/a,  i.e., 
A|  ~ 1/a.  Similarly,  we  define  the  spectral  bandwidth  Aw  as  the  frequency 
interval  -Aw  < w < Aw  within  which  5(w)  compares  to  5(0).  Outside  this 
interval  5(w)  is  much  less  than  5(0).  For  the  examples  chosen  Aw~a.  Hence 
the  correlation  time-bandwidth  product 


A|Aw 


(7.59) 


By  introducing  a more  exact  definition  of  A?  and  Aw  we  can  derive  a 
correspondingly  more  precise  expression  for  AJAw.  Let  die  spectral  bandwidth 
Aw  be  defined  as 


I 5(w)t/w  = 2 I ,5(w)(/w  = 25(0)Aw 
•'-»  -^0 


(7.60) 


Definition  (7.60)  specifies  Aw  such  that  the  curve  .S'(w)  can  be  approximated  by 
a rectangle  of  height  5(0)  and  width  2Aw  as  shown  in  Fig.  7.3).  The  area  of  the 
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rectangle  is  equal  to  the  area  under  the  curve  S(co).  Similarly,  the  correlation 
time  is  defined  as 

R(^)d^  = 2/?(0)A|  (7.61) 

Using 


5(0)  = r /?(|)c/|  «(0)  = ^ [ 5(w)c/w 

the  exact  relation  for  the  correlation  time-bandwidth  product  is 

A|A(o  = y (7.62) 

Equations  (7.59)  and  (7.62)  apply  to  any  pair  of  functions  which  are  the 
Fourier  transform  of  each  other.  Of  particular  interest  is  their  application  to  the 
impulse  response  /i(|)  and  transfer  function  /7(co).  The  quantity  Aco  defines  the 
pass-hand  of  the  system.  A|  defines  the  correlation  time  or  memory  of  the 
system.  The  system  memory  is  taken  to  be  the  time  interval  during  which  the 
input  appreciably  influences  the  output.  It  symbolizes  the  time  during  which  the 
system  responds  to  a unit  impulse. 


7.8  TRANSFER  FUNCTIONS  AND  LINEAR  OPERATORS 

The  relationship  between  s(t)  and  m(t)  was  specified  by  (7.18)  as 

m(t)  = //[s(t)l 


. ' 

--vw  ’C**.*-  “ 
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where  //  is  a linear  operator.  The  Fourier  transform  of  the  impulse  response  (c.l. 
eq.  (7.38)1  established  the  transfer  function  //(cj)  for  the  optimum  filter  as  |c.f. 
eq.  (7.41)1 


//(CO)  = 


5„(co) 


Equation  (7.41 ) is  the  direct  result  of  the  input-output  relation 


= / 


>•(/)=  /!(?)«(/ -?)C/? 


We  next  investigate  the  relationship  of  //  and  //(co).  From  (7.18)  the 
integral  form  of  the  operator  equation  is 


m(t)  = j h(t')s(t  - t')dt' 


(7.63) 


Previously  we  had 


— oo 

= J"  e~'^m(t)u(t  - (7.64) 


Substituting  m(t)  from  (7.63)  into  (7.64)  gives 


= J e-^‘^fc/?j'  h{t')RsuiX- (7.65) 
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By  changing  the  variable  of  integration,  such  that  ? = / + /'.equation  (7.65)  can 
be  written  as 


„(w)  = I*  e 


= //(cj)5su(tj)  (l.Mi) 

Thus  the  transfer  function  as  derived  from  the  operator  relation  (7,63)  is  the 
ratio  of  the  spectral  densities  and  .9j„(co).  where  we  have  specified  the 

general  form  of  //(co)  as 


//(w) 


r 


(7.67) 


Obviously,  not  all  linear  operators  can  be  written  in  the  form  of  (7.6.3).  In 
these  cases  the  transfer  function  //(co)  defined  to  be 


//(w) 


^sui^) 


A few  simple  examples  illustrating  this  point  are  as  follows:  For  simple  filtering 
H is  the  unit  operator,  i.e.. 


m(t)  = //li(r)l  = s(t) 

and  Smu(<^)  ~ 5su(cJ).  Thus  the  transfer  function 

//(w)  = 1 


For  time-shift  operators  where 


m(t)  = s(t  + A) 


it  is  readily  shown  that 


H(oj)  = 


The  differentiation  operator 
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gives 


//(cj)  = /co 


whereas  the  integral  operator 


=i 


mil)  = dl  {s(/)} 
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iij' 


{5(0} 


gives 


//(w)  = — 

/CJ 

Kach  of  the  above  identities  can  be  proven  by  forming  S„,u(u>)  and  .9ju(u))and 
applying  formula  (7.66). 

Generalizing  the  above  equations  it  can  be  said  that  if  the  transfer  function 
//(w)  is  a polynomial 


//(CO)  = E C(,(/to)‘' 
v=o 


(7.68) 


where  the  coefricients  Q are  constants,  then  the  corresponding  operator  can  be 
written  as 


//  = i:  Q (j 

5=0 


(7.69) 


Formula  (7.69)  applies  only  when  (7.66)  can  be  written  as  the  ratio  of  two 
polynomials 


//(CO)  = 


/'ft  (CO) 


(7.70) 


where  /^(co)  and  /'/,(co)  are  polynomials  of  degree  a and  h.  respectively,  and 
a>  h.  If  < h the  numerator  is  divided  by  the  denominator  giving 


//(co)  = //qfco)  + //i(co) 


(7.71 ; 


— . V*  ■ 1 
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where  » 

b~a 

//oM  = 

li=0 

and  tlie  remainder 


//,(w)  = J 


(7.72) 


(7.73) 


7.9  MONOCHROMATIC  SIGNALS 

The  classical  form  of  an  amplitude  modulated  signal  generated  by  an  harmonic 
oscillator  is 

s(t)  = a cos  u>or  + b sin  coq/ 

= e cos(coo^  (7.74) 

where 


a = e cos  ^ b = esin^p 


Let  the  modulation  parameters  a and  b,  or  e and  <p,  be  random  variables  which 
are  independent  of  time  and  satisfy  the  conditions 

a = b = 0 = b^  ah  = 0 (7.75) 

Then  waveform  (7.74)  can  be  considered  as  a random  process.  In  order  for  this 
process  to  qualify  as  a quasi-monochromatic  signal  the  bandwidth  Aco  of  the 
modulation  must  be  very  small  compared  to  the  carrier  frequency  wq- 

Aw  < Wo  (7.76) 

In  accordance  with  conditions  (7.75)  the  stationary  random  functions  a(t)  and 
bU)  have  the  following  statistical  properties 

W)  = W)  = Q (7.77) 

a(/)a(r  - ?)  = b{t)b(t-%)  = cM%)  (7.78) 
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= 0 (7.79) 

where  c“r(J)  is  the  autocorrelation  function  for  processes  a{t)  and  h(i)-  By 
property  (7.79)  there  is  no  cross-correlation  between  the  two  processes.  The 
constant  c~  is  determined  by 


= a~  = (7.80) 

i.e.,  it  is  the  mean  square  value  of  a and  h,  so  that 

r(0)  = I (7.81) 

Correlation  functions  satisfying  conditions  (7.81 ) are  said  to  be  normalized. 
Calculating  the  correlation  function  for  s(t)  we  have 

^s(?)  = [a(/)coscoof  + 6(Osincuo^][a(f-  ?)coswo(f  ~ |)sinwo(?  ‘ ?)1 

= c^r(f)[coscjot‘coswo(f  - ?)  + sinwof  sincjo(^  " ?)l 

= c^r(^)coscjol  (7.82) 

On  normalizing  we  have  for  ^ = 0 

RAO)  = (7.83) 

The  spectral  density  for  the  normalized  correlation  function  r(^)  is 

s(w)  = f e-'^ra)di  (7.84) 

— oo 

Its  corresponding  inverse  becomes 

/■(?)  = ^ f e'^(s(oj)dcj  (7.85) 

— oo 

Thus  the  spectral  density  5j(cj)  of  the  quasi-monochromatic  signal  s(r)  can  be 
written  as 


I 

t Ss(co) 

I 


I 


e-i^^RA^)d% 


e '^^r{^)cosu>o^d^ 


J 
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= — (4(0;  ~ wq)  + s(uJ  + uJo)l  (7.86) 

We  assume  that  s(w)  is  bell-shaped  with  its  maximum  at  co  = 0.  Therefore  from 
(7.86)  and  (7.76)  i’jfco)  represents  two  non-overlapping  bell-shaped  curves 
centered  approximately  at  -cjq  and  +coo.  From  (7.62)  where 

A^Aco  = y 

inequality  (7.76)  implies  that 

cjoA?  > 1 (7.87) 


I 

I 

I 

I 


Since  A?  defines  the  variation  with  time  of  the  random  functions  a(r)  and  h(r) 
condition  (7.87)  specifies  that  cos  ojor  and  sin  ojQf  undergo  many  oscillations  in 
time  before  a(t)  and  b(r)  change  appreciably. 

For  the  ensuing  calculations  we  choose  the  form  of  the  normalized  correla- 
tion function  as 


r(?)  = (7.88) 

where  a determines  the  spectral  bandwidth.  The  corresponding  Fourier  trans- 
form gives 


s(to)  = 


2a 
a?-  + 


(7.89) 


From  (7.8 1 ) and  (7.85)  we  have 


— J s(u>)dui  = 1 

which  when  combined  with  (7.60)  gives 

Aw  = 


(7.90) 


STATISTICAL  SYSTEMS-SIGNALS  IN  NOISE 


145 


We  now  examine  the  optimum  filter  which  separates  the  quasi- 
monochromatic  signal  from  so-called  “white  noise,”  i.e.,  noise  of  constant 
spectral  density.  We  have 

,S’„(w)  = -S’„  = const.  (7.91) 

For  simple  filtering  where  there  is  no  correlation  between  signal  and  noise,  i.e., 
where 

s(f)n(r-  ?)  = 0 

the  transfer  function  is,  by  (7.44), 

Sj(co)  + S„(co) 


and 


h(i)  = — I ^ <'^•‘^2) 

27T  j _ 1 -I- (6'„(co)A9s(w)l 

Our  solution  to  (7.92)  will  be  confined  to  positive  values  of  t since  /i(|)  is  an 
even  function.  Representing  the  denominator  of  (7.92)  as  D{u>)  we  look  for  the 
roots  of  the  equation  D(co)  = 0.  For  co  ~ coq  we  have  approximately,  from 
(7.86). 


^ c‘-a 

S,(u>)  ~ ^s(eo“UJo)  ~ , ,2  j.  0,2 

2 (oi-too) 

where  the  term  .i(co  - co„)  ~ s(2coo)  is  neglected.  Thus,  the  denominator 
D(w)  = I + S„ 


(co  - cOq)7  + 


has  the  roots 


CO  = u)Q  4 fO 


where 

0 = n/q:7  + (c^alS„) 
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Similarly,  two  other  zeros  of  /)(w)  can  be  found  for  At  the  poles 

w = icoo  + jO  of  the  integrand  in  (7.92)  the  derivative  dDjdu)  is 

dP  ^ 

d<jj  ^ c^a 


where  we  have  asserted  that  only  the  poles  in  the  upper  half-plane  are  important. 
The  contour  of  integration  is  dosed  in  the  upper  half-plane  and  the  integral  is 
reduced  to  a sum  of  residues.  Tire  final  expression  for  //(?)  becomes 

a- 

//(?)  = -^e~"'^'cosu)of  (7.93) 


where 


^ jj  s\t ) 

aS„  2 .9„Aco 


(7.94) 


The  dimensionless  parameter  p represents  the  signal- to-noise  power  ratio.  It 
depicts  the  extent  to  which  the  signal  is  stronger  (or  weaker)  than  the  noise. 
From  the  above  the  parameter  0 can  be  written  in  terms  of  a as 

0 = asj\  + p (7.95) 

According  to  (7.83)  represents  the  average  power  of  the  quasi-monochormatic 
signal,  and  by  (7.90)  ,S'„a  is  the  noise  power  in  the  band  occupied  by  the  signal. 
For  small  signal-to-noise  ratio  p <€  I . Therefore  equation  (7.95)  implies 


0 = a 


Hence,  from  (7.93).  (7.94)  and  (7.88)  the  impulse  response  is 
('(?)  = ape‘“'^'cos  W()? 

Rs(^) 


(7.96) 


and  fron  (7.44)  the  transfer  function  reduces  to 

5j(oj) 

//(CO)  = 


(7.97) 


which  is  in  agreement  with  our  discussion  in  Section  7.4. 
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The  pass-band  established  by  llie  roots  of  /)(w)  include  the  frequency 
intervals  too  - and  ~coo  i i^to  where,  from  (7.90)  and  (7.9d),  the  spectral 
bandwidth  ilto  is  determined  by  the  parameter  d : 


Au)  = ~ 0 


-ayjl+p 


(7.98) 


We  see  from  (7.98)  that  as  the  noise  power  S„  decreases  the  pass-band  Ato 
increases.  This  leads  to  a shortening  of  the  (memory)  time  I/O  required  for  the 
signal  to  be  extracted  from  the  noise,  which  agrees  with  our  earlier  conclusion 
that  for  simple  filtering  in  the  absence  of  noise  no  time  is  required  to  duplicate 
tlie  signal.  However,  the  extraction  of  signals  in  the  presence  of  noise  must  be 
done  in  a time  less  than  the  correlation  time  of  the  signal,  in  which  case  we 
always  have  0 > a. 

The  mean-square  filtering  error  for  the  quasi-monochromatic  signal  is 


(co)dco 


= 


if 


Ss  (w) 


5j(w)  + 5„(w) 


if 


Ss(oj) 


+ [S„ISAoj)\ 


(Jeo 


(7.99) 


The  integral  of  (7.99)  is  evaluated  by  calculating  the  residues  associated  with  the 
zeros  of  the  denominator  (w  = ±cof)  +j0)  and  the  residues  associated  with  the 
poles  of  the  numerator  (w  = icoq  + /a).  Equation  (7.99)  becomes 

-J  _ SsiooQ  +j0)  + ■^.t(-CCQ  +j0) 

^ " 2S„  Olc^a 


(7.100) 


vT+p 


die 


Consider  the  situation  where  the  signal  is  “highly"  monochromatic,  i.e.. 
parameter  a approaches  zero.  Therefore,  by  (7.94)  p then  increases  and  by 
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(7.100)  €“  decreases,  htteclively.  the  iilier  pass-band  Aco  becomes  narrower  and 
its  correlation  time  A/ ^ \/0  becomes  larger  tliereby  making  more  effective  use 
of  the  signal.  l or  strictly  monochromatic  signals  a = 0 = cj  = 0 and  = 0.  In 
this  case  the  intinite  working  time  of  the  filter  produces  an  infinitely  small  (or 
zero)  error.  Clearly,  tor  a finite  working  time  the  error  is  greater  than  zero. 


7.10  SIGNALS  OF  KNOWN  FORM  IN  NOISE 

In  our  previous  discussions  we  treated  the  signal  and  noise  as  random  processes. 
In  the  disciplines  of  radar  and  radio  communications  the  signal  is  usually  of 
known  form  and  is  not  regarded  as  a random  process.  Instead,  it  can  be  regarded 
as  a known  function  with  several  unknown  parameters,  such  as  amplitude  or 
phase.  In  such  cases  filtering  must,  (a)  make  the  most  reliable  observation  of  the 
useful  signal,  and  (b)  make  the  most  precise  measurement  of  the  signal's  un- 
known parameters.  Consequently,  a performance  metric  of  a filter  extracting 
signals  from  noise  can  be  the  signal-to-noise  ratio  at  the  output. 

Assuming  the  signal  s(t)  has  a well-defined  form  we  have  from  (7.16) 

u(f)  = s{t)  + nit) 

where  ii(t)  is  a random  process.  The  corresponding  output  is 
y(t)  = H\u(t)\ 

= ll\s(t)\  + ll\nU)\ 

= aU)  -t  nU)  (7.101 ) 

where  o(r)  and  rj(r)  are  the  results  of  passing  the  useful  signal  and  noise,  respec- 
tively. througli  the  filter.  For  s(t)  we  can  write  its  Fourier  transform  and  inverse, 
respectively,  as 


■ if 


c ^“'.v(r)(/r 


e^“'.S'(co)r/w 


r 
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a{t) 
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(7.102) 


Treating  the  noise  as  a random  process  we  can  write  for  n^(t)  [c.f.  equation 
(7.48)1 , the  average  noise  power  at  the  input. 


i / 


n'^U)  = I S„(cj)c/u) 

The  noise  at  the  output  of  the  filter  becomes 

= ^ / |//(w)|2<(„(w)c/co 

•/~oe 

Accordingly,  we  define  the  signal-to-noise  power  ratio  p as 

p = - 


(7.103) 


(7.104) 


(7.105) 


where  <r(^o)  is  the  signal  value  at  a specified  time  tp-  from  (7.102)and  (7.104) 
tire  signal-to-noise  power  ratio  can  be  written  as 


*^-oo 


0})S(cj)dcj 


in 


(7.106) 


J lf/(u>)I^S„(cj)du> 


We  now  look  for  a filter  which  gives  the  largest  value  ofp.  Our  analytical 
procedure  will  be  to  use  the  relation 


,v(/o)  = 'J(fo)  + n(to) 

to  decide  whether  or  not  a signal  is  present,  and  make 


lo(0))l  > VnMto) 


(7.107) 


(7.107a) 
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by  as  imicli  as  possible.  Applying  the  Schwarz  inequality 


ir 


j; 


l//(w)l^.S’„(tj)dw 


r"js(^ 


iiu}  (7,108) 


to  (7.106)  the  upper  bound  of  the  signal-lo-noise  ratio  can  be  established  as 


< 


2n  I .S„(co) 

«/-eo 


(/co 


(7.109) 


If  we  let  the  transfer  function  //(co)  have  the  form 

//(CO)  = ^ (7.110) 

where  A'  is  an  arbitrary  constant,  p reaches  its  niaxiniuin  in  (7.109).  We  have 


J_ 

27T 


J 5;,(0J) 

»/_O0 


t/cj 


(7.111) 


Thus,  a linear  filter  with  the  transfer  function  (7.110)  is  the  best  filter  among 
linear  filters.  Further,  if  the  noise  /i(t)  is  a normal  random  process  (i.e.,  Gaus- 
sian) the  filter  (7.1 10)  is  an  absolute  optimum.  Physically,  (7.1 10)  is  interpreted 
to  mean  the  larger  the  amplitude  spectrum  of  the  useful  signal  and  the  smaller 
the  power  spectrum  of  the  noise  in  the  frequency  interval  (to,cj+r/co).  the  more 
the  optimum  filter  will  pass  those  frequencies.  Also,  from  (7.1 1 1 ) it  is  apparent 
that  the  greater  the  displacement  of  frequency  spectra  of  the  useful  signal  and 
noise  the  greater  the  signal-to-noise  ratio  at  the  filter  output. 

In  using  filter  (7.1 10)  to  detect  signals  of  known  form  it  is  only  necessary 
to  know  the  value  of  (7.107),  i.e.,  the  value  of  the  output  function  at  time  to- 
According  to  (7.102)  and  (7.1 10)  we  have 


a(f) 


<^-oc 


1.^(CJ)|2 


Jco 


(7.112) 


from  which  it  follows  that 


|a(r)|  < |o(/o)l  = \k\p 


(7.113) 
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By  setting  k = J in  (7,1 10)  we  see  that,  according  to  (7,1 1 1 ) and  (7,1 12), 

a(fo)  = P (7,114) 

and  according  to  (7,105) 

= p (7  114a) 

i.e,,  the  signal-to-noise  ratio  gives  simultaneously  the  useful  signal  and  the  noise 
power  at  the  filter  output.  In  this  case  it  can  be  shown  that  at  the  filter  output 
the  useful  signal  is  related  to  the  correlation  function  of  the  noise  according  to 
tlie  formula 


a(')  = R^(to  - t) 


(7,115) 


7.11  THE  MATCHED  FILTER 

In  the  case  where  the  noise  spectrum  is  uniformly  distributed  over  the  useful 
frequency  band  we  have  the  condition  called  “white  noise,”  For  tliis  condition 

= ,y„  = const.  (7,1  16) 

From  (7,1 10)  the  transfer  function  is 

H{<J)  = (7,117) 

where  arbitrarily  we  have  chosen 


k = ,y„ 

Thus,  for  signals  of  known  waveform  wherein  the  noise  is  treated  as  a constant, 
the  filter  can  be  matched  to  the  waveform  (or  its  conjugate).  Therefore,  it  is 
usually  called  the  matched  filter  (or  conjugate  filter). 

Since  the  signal  s{t)  is  real  the  conjugate  of  its  Fourier  transform  can  be 
written  as 


S*(u)  = S(~oj) 


(7.118) 
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Tlic  signal  ai  the  output  of  tlic  filter  is,  acconJing  to  (7.102),  (7.117)  and 
(7.118), 


a{t) 


± r 

2n 

J — ot 


"'‘*"'//(to).S'(co)(/tj 


which,  upon  changing  the  order  of  integration,  gives 

OO  I*  oo 

= ^ I S(r')r//'l  'U  '''.S’(- 

%/—00  %J—<30 


0(t) 


-CO)(/cJ 


= I s(r')s(r'~  / + ri))i/t' 

•^-oo 

Clearly  (7. 120) has  the  form 


(7.110) 


(7.120) 


i 


Earlier  [c.t.  eq.  (F.17))  we  defined  the  time-average  or  integral  form  of  the 
autocorrelation  function  Wj(?)  as 


£ 


,s(r)s(r-  ?)(/? 


(7.I2I) 


The  integral  form  of /?s(?)  differs  from  definition  (7.10).  Instead  of  taking  the 
ensemble  average  s(/)s(/-J)  for  a large  number  of  identical  experiments,  we 
observe  a single  sample  function  which  contains  all  the  statistics  of  the  process. 


r 
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h'roni  ( 7. 1 2 1 ) wc  have  for  an  ergodic  process 


«j(0)  = I sHDdt  = /•' 


(7.122) 


wliich  is  in  keeping  with  (7.48). 

Substituting  (7.121 ) into  (7.120)  the  signal  at  the  filler  output  is 
oU)  = RsU~  <o) 


(7.123) 


i.e.,  the  matched  filter  is  a correlator',  its  response  to  the  useful  signal  is  its 
correlation  function.  At  / = rp  the  output  becomes 

o(fi,)  = «s(0)  = R (7- >24) 

Khintchine's  theorem  stales  that  for  any  random  process  n(r) 

S„(co)  > 0 

Therefore,  the  following  inetiuality  holds: 


' ^-OO  *■  -<* 


Thus,  it  follows  that 

/?j(0)  > |/?s(t)l  a(/o)  > |o(/)l  (7.125) 

a(ro)  is  the  maximum  value  of  the  useful  signal  at  the  filter  output;  this  max- 
imum value  is  the  total  energy  of  the  signal. 

From  (7.111)  and  (7.1 16)  the  signal-lo-noise  ratio  is 


^ |5(co)|2r/o 


And,  from  (7.1 18)  and  (7.122)  we  have  the  identity 


I T” 
T-  I' 


5(co)|7c/co  = I s^U)dr  = /■: 


• 
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Tluis.  the  signal-lD-noise  ratio  takes  on  the  simple  torm 


P = 


E 


/ 


(7.127) 


i.e..  it  is  determined  by  the  energy  content  of  the  signal  and  the  noise  spectral 
density.  For  situations  where  the  background  noise  is  “constant”  the  only  way 
to  improve  detecting  a signal  of  known  form  is  to  increase  its  total  energy. 

According  to  (7.39)  the  impulse  response  (i(f)  for  the  matched  filter  can 
be  written  as 


h{t)  = 


e^‘^'//(cu)c/w 


= S(t-tQ) 


(7.128) 


Thus,  the  impulse  response  of  the  best  filter  in  white  noise  is  the  mirror  image  of 
the  signal,  delayed  by  /q  seconds.  Therefore,  by  (7.23)  the  output  of  the 
matched  filter  is  of  the  form 


o(r)  = I s(t'~  r + ro)u(t')dt' 


Tlie  matched  filter  forms  the  cross  correlation  between  the  useful  signal  s(t)  and 
the  input  u(t). 

To  appraise  the  operation  of  the  matched  filter  we  assume  the  useful  signal 
has  a sinusoidal  waveform  and  contains  energy  E.  We  specify  the  transfer  func- 
tion as  “rectangular”  having  the  following  characteristics: 


//(w)  = I 


for  -cj  - Acj  < cu 


< -OJ  + iloj 


11(0})  = 0 


for  wp  " Acj  < CJ 


< tj()  -F  Acj 


(7.129) 


r 
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Our  objective  is  to  compare  the  signal-to-noise  ratio  of  the  “mismatched”  rectan- 
gular filter  with  that  of  the  matched  filter  giving 


PO 


= a 


where  Pq  is  the  signal-to-noise  ratio  of  the  matched  filter.  The  exercise  below 
will  be  to  determine  the  value  of  a. 

We  specify  for  the  sinusoidal  pulse  under  consideration  the  appropriate 

form 


s(r)  = Acoscjgt 


= 0 


7b  ^ ^ 7b 

-T  <'<T 


/ -7-0  ^ 7b 

' < > Y 


(7.130) 


where  cog  is  the  carrier  frequency,  A is  the  pulse  amplitude  and  Tq  is  the  pulse 
duration  (or  period).  One  can  write  for  5(co) 


5(co) 


=£ 


f7o 

= A 

■J-Tn 


Tol2 


e cos  cog/  cit 


Ton 


= A 


sin(co  - a)o)7g/2  ^ sin(co -t- cjg)7b/2 


(7.131) 


CO  - cog  to  -I-  cog 

For  the  conditions  where  <jJqT>  1 equation  (7.1 31 ) becomes 

sin(co-coo)rg/2 

5(co)  = A 

CO  - cog 

Frop  (7.102)  the  filter  output  for  the  simple  filtering  specified  by  C^.l  29)  is 

-U)q 


(7.132) 


oU) 


- 


?^‘^'5(co)c/co  + 


Wq  - Au) 


1 

2n 

■ 


e^‘^'.9(co)cfco 


c*)Q  - Aw 
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7T 


i 


W()  +Au; 


S'(0J)CDS  OJt  Jcj 


W() 


By  siibsliUiting  (7.1  .?2)  into  (7.1  33)  we  have 


0(0  = 


A 

n 


COS  CO/ 

cUq  Au> 


sin(co  coo)7((/2 

CO  ~ C0() 


doj 


If  we  impose  the  condition 


.Aco7() 


< n 


f7.l.)3) 


(7. 1.(4) 


(7.1  3.3) 


tlien 


sinfco  - tj())7()/2 

CO  - CO() 

is  positive  over  tlie  integration  interval.  Therefore 

/‘'u>()  + Au) 


|o(/)|  < 


A 

7T 

^ LJi\- 


W{)- Au) 


sin(to-  loo)To/2 

coscoO du>  < 0(0)  (7.136) 

to  - t0() 


where 


ofO) 


= A 


sin(to  ■ coo)ro/2 
to  - top 


t/to 


2y4  / '^sin  X 

" Jn  ~ 


dx  (7.137) 


' u>()- Au> 

or  = (to- co())rn/2;  v = Ato7()/2  (7.138) 

From  (7.104)  and  condition  (7,1 16)  the  noise  intensity  at  the  ftlter  output 


_ , rwo+Auj 


(to)(/to 


2S„ 


-Aui 


(7,13d) 
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i.e..  p is  approximately  one  decibel  less  than  Pq.  Hence,  the  matched  filter  gives 
a larger  signal  to  noise  ratio  because  its  transfer  function  fits  the  signal  spectrum 
from  the  standpoint  of  both  its  bandwidth  and  shape  (amplitude). 


Quantized  Systems-Perturbation 
Theory  and  State  Transitions 


8.1  INTRODUCTION 

As  an  introduction  to  some  of  the  methods  of  statistical  analysis  we  will  exploie 
in  a preliminary  fashion  some  of  the  probabilistic  methods  of  quantum  theory, 
particularly  as  the  theory  describes  the  probability  of  state  transitions  resulting 
from  small  energy  perturbations.  The  theory  has  experienced  a high  degree  of 
success  in  physics  and  chemistry,  and  is  introduced  here  primarily  as  a method 
for  analysis.  In  the  quantum  picture  the  system  is  analyzed  on  the  basis  of  its 
energy  content,  which  is  quantized.  Associated  with  each  of  the  quantized 
energy  levels  are  the  system  states.  The  input  to  the  system  represents  a change 
(perturbation)  in  the  total  energy  of  the  system.  This  change  results  in  the 
system  transitioning  to  various  different  allowable  energy  states.  The  probability 
of  a (state)  transition  occurring  is  directly  related  to  the  perturbing  energy  and 
the  system’s  allowable  quantized  energy  levels. 

There  exists,  however,  a fundamental  difference,  between  the  quantized 
system  and  the  linear  system.  This  difference  arises  from  the  theoretical  con- 
cepts governing  the  respective  theories  and  is  manifested  in  tlie  system  dynamic 


1^*.  'm 
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equations.  It  was  seen  earlier  that  tlie  dynamic  state  equation  for  the  linear 
system  is 


|x>  = Alx>  + B|u> 

which  is  an  outgrowth  of  tlie  idea  of  ordered  input-output  pairs  and  techniques 
to  attach  a (state)  “label”  to  the  pairs.  The  “motion”  of  a quantized  system  is 
governed  by  the  Schrodinger  equation 

/h|x>  = H|x> 

which,  as  will  be  seen  below,  is  one  of  the  fundamental  postulates  of  quantum 
theory. 


8.2  FUNDAMENTAL  POSTULATES  OF  QUANTUM  THEORY 

Tlie  intended  goal  here  is  to  apply  the  methods  of  quantum  theory  to  achieve 
simple  rules  in  describing  system  state  transitions.  In  describing  tliese  methods 
we  first  examine  those  fundamental  postulates’  which  importantly  serve  as  the 
basis  for  the  theory.  Tliey  are  as  follows: 

POSTULATE  A.  To  every  measurable,  real  quantity  F there  is  associated 
an  (Hennitian)  operator  F called  an  observable. 

For  a Hermitian  operator  to  be  an  observable  it  is  required  that  the  eigen- 
vectors associated  with  the  operator  span  the  entire  space.  Thus,  the  eigenvectors 
lx,->  of  the  observable  can  serve  as  a complete  set  of  orthonormal  base  vectors 
where 

<X,|Xy>  = 5,y 

Any  finite  state  vector  |x>  in  this  space  can  be  described  in  terms  of  the  eigen- 
vectors: 


lx>  = Z]c,|x,) 

i 

where  the  Ci  are  the  expansion  coefficients. 


(8.1) 


1 . Sec,  for  example,  Grossman,  L.  M.,  Tfterniodynamics  and  Statistical  Mechanus.McGvdW' 
Hiil.  New  York.  1969.  pp.  77-87. 
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POSTULATE  B.  The  only  possible  values  of  a measurement  of  the  mea- 
surable quantity  F are  the  eigenvalues  X of  the  operator  F. 

From  Postulate  A the  |x/>  are  the  eigenvectors  of  F;  i.e., 

Fix,)  = X,|x,->  (8.2) 

The  scalar  product  formed  by  (8.1)  and  the  /th  eigenvector  identifies  the  (th 
coefficient  c,-  as 

Cj  = <x,lx>  (8.3) 

The  probability  that  the  eigenvalue  X,-  of  F will  be  measured  in  state  |x>  is 
where  c/|x,>  is  the  projection  of  |x>  along  the  normalized  basis  eigenvector  |x,>. 

POSTULATE  C.  If  a system  is  in  a state  represented  by  the  vector  |x>  the 
expected  mean  value  resulting  from  a number  of  measurements  of  the  measura- 
ble quantity  F,  whose  observable  is  F,  is 

_ <xlFlx> 


where  the  double  bracket  <•>  enclosing  a single  quantity  denotes  the  expectation 
or  mean  value. 

If  the  state  vector  |x>  is  normaliz.ed  clearly  (8.4)  becomes 

<F>  = <x|F|x>  (8.5) 

Substituting  (8.1)  into  (8.5)  yields  for  the  expectation  value 


<F)  = H(c]‘cj)Fjj 


where  the  X,-  are  the  eigenvalues  of  F.  Interpreting  the  coefficients  c,-  of  each  X, 
as  a “weighting”  factor  or  the  probability  of  measuring  the  eigenvalue  X,-  we  can 


(F)  = L Pi  X,- 
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t , 


where 


Pi  = c*c-i  = UvP 

In  (8.7)  we  require  the  normali/ation 


lilCll* 


(8.7a) 


for 


a > 0 


(8.7b) 


From  (8.3)  it  is  seen  that  the  probability  /)■  is  the  square  of  the  absolute  value  of 
the  scalar  product  of  the  state  vector  |x>and  the  rth  eigenvector;  i.e., 


Pi  = lc,-l2  = |(x,-|x>|2 


If  the  system  is  in  a state  represented  by  one  of  the  eigenvectors  of  the  observ- 
able F then 


Pi  = |<x,|x/>|2  = 6y- 


(8.8) 


Hence  the  measured  value  of  an  observable  F possesses  a well  defined  probability 
of  one  or  zero  if  the  state  of  the  system  is  an  eigenvector  of  F.  In  this  case  all 
probabilities  are  zero  except  the  one  measuring  tlie  occurrence  of  the  eigenvalue 
X;,  which  is  unity. 

In  considering  two  measurable  quantities,  say  F and  G,  it  can  easily  be 
shown  that  they  can  be  measured  simultaneously  only  if  their  associated  resoec- 
tive  operators  F and  G commute;  i.e.. 


[FG-  GF]  = 0 
IF.GI  = 0 


This  is  to  say  that  the  vector  representing  the  system  state  must  be  an  eigen- 
vector of  both  observables.  The  quantities  corresponding  to  the  two  operators 
can  therefore  be  aribtrarily  well  defined  in  the  same  state.  If  the  operators 
corresponding  to  the  two  physical  quantities  F and  G do  not  commute,  there 
will  be  a dispersion  AFand  A6'  inherent  in  their  measurement.  Tliis  dispersion  is 
given  by  the  inequality  specified  by  the  Heisenberg  uncertainty  principle: 


<(AF)2>((A(7)2>  > 


— ^ . 
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where 

C = /[F,G] 

POSTULATE  D.  The  dynamical  behavior  of  the  system  represented  by 
the  state  vector  |x>/s  determined  by  the  Schrodinger  equation 

/h^|x>=H|x>  (8.9J 

Of 

where  h = hl2it=  i.05  X erg  sec  and  H is  the  operator  (observable)  corre- 
sponding to  the  classical  Hamiltonian  of  the  system. 

The  relationship  between  measurable  quantities  and  classical  concepts  can 
be  readily  established  from  the  time  development  of  the  expectation  value  <f’>. 
In  view  of  (8.9)  we  particularly  want  to  explore  tlie  time  dependency  ofjhiF). 
From  equation  (8.5) 

/h<F)  = /h<x|F|x> 

Using  the  product  rule  for  differentiation  and  the  properties  of  the  complex 
scalar  product  we  have 

/h^<xlFlx>  = <x|F/h-^|x>  + /h(xl|^|x>  - /h^(x|Flx> 
which  becomes  upon  using  the  operator  relationship  of  (8.9) 

;h^<xlF|x>  = <x|F|x>  - <xlHFlx>  + /h<x||j|x> 

= <x|FH-  HF|x>  + /h<xl|^lx>  (8.10) 

ot 


If  the  operator  dF(dt  is  defined  as 


c[F 

dt 


<x||f|x> 


then  equation  (8.10)  yields  the  dynamical  law  in  operator  form: 


dt 


/h 


F.G] 


if 

dt 


(8.11) 


Equation  (8.11)  is  also  referred  to  as  the  Heisenberg  equation  of  motion. 
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8.3  ENERGY  PICTURE 


Clearly  from  (8.10)  and  (8.1 1)  if  the  operators  F and  H commute,  and  F does 
not  change  with  time,  then  the  mean  value  of  the  measurable  quantity  /-'remains 
constant;  i.e..  J(F)IJt  = 0.  It  follows  tliat,  for  this  situation,  the  probability 
of  measuring  eigenvalue  X,  is  also  independent  of  time.  Hence  the  observable  is  a 
conserved  quantity.  Of  particular  interest  among  conserved  quantities  is  the 
associated  energy.  If  the  observable  H corresponding  to  the  classical  Hamiltonian 
of  the  system  is  not  a function  of  time  (dHidt  = 0),  and  since  H commutes  with 
itself,  the  mean  or  average  value  of  the  energy  is  constant;  i.e., 


c/H 

dt 


/■fi 


[H.H]  + ~ 


3/ 


Thus  for  an  isolated  system  where  3H/3/  = 0 equation  (8.1 1)  is  in  concert  with 
the  law  of  conservation  of  energy. 

For  conservative  systems  general  solutions  to  the  Schrodinger  equation 
can  be  obtained  in  a straight  forward  manner.  These  solutions  express  the  state 
vector  in  terms  of  the  (energy)  eigenvectors  of  H.  This  can  be  seen  by  consider- 
ing the  following.  From  Postulate  D the  equation  of  motion  is  concisely 


/h^lx)  = H|x> 
at 


Since  t does  not  appear  explicitly  in  the  differential  equation  we  can  look  for 
solutions  of  the  form 


|x(r)>  = mix) 

i.e.,  a form  consisting  of  the  product  of  a time-dependent  part  and  a nontime- 
dependent  part.  The  corresponding  eigenvalue  problem  is  formulated  as 

H|x/>  = X,lx,-> 

= (8.12) 

where  the  observable  H,  which  is  the  measurable  energy,  is  independent  of  time. 
The  F,-  represent  the  energy  eigenvalues  and  lx,  > the  corresponding  energy  eigen- 
vectors. Since  the  eigenvectors  form  a complete  orthonormal  set  the  vector  |x(r)) 
in  the  state  space  can  be  expanded  as 

|x(r)>  = Z]a,(01xi> 

I 


(8.1.^) 
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where  the  coefficients  a,(r)  are  time  dependent.  Substituting  (8.13)  into  (8.9) 
gives 


S(H  + ^|:)ir,(r)|x,>  = 0 (8.14) 

which,  in  view  of  (8.12),  becomes 

L lx/>  = 0 (8.15) 

1 

! t The  complex  scalar  product  formed  by  (8.15)  and  the  reciprocal  vector  (x^l 

yields  for  the  A:th  coefficient  the  first-order  differential  equation 

' ak(.t)Ek  + = 0 

for  which  the  solution  is 

i'  ai^t)  = (8.16) 

The  coefficient  q is  a constant.  Thus  for  an  isolated  system  where  the  Hamil- 
tonian is  independent  of  time  the  solution  to  the  dynamical  equation  (8.9)  is  of 
the  form 


lx(r)>  = Ec,■e-^'^'"^x,>  (8.17) 

i 

Time  enters  into  the  solution  of  (8.9)  strictly  as  a phase  factor.  The 
coefficients  c,-  are  constants,  the  |x,->  are  eigenvectors  of  the  energy  operator,  and 
the  Ej  arc  the  corresponding  energy  eigenvalues.  From  (8.7)  and  (8.16)  the 
probability  of  measuring  the  Ath  energy  eigenvalue  £> 

= Ic^i^  = const.  (8.18) 

Hence  for  isolated  systems  all  probabilities  are  constant  in  time.  Only  the  rela- 
tive phases  of  the  component  states  change.  If  tlie  system  is  in  a given  energy 
eigenstate  at  time  t it  will  remain  in  that  state  corresponding  to  the  same  energy 
iivalue  for  alt  time.'  Tliis  is  true  for  the  values  of  any  other  observables 

I Op  lit 


f 
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which  commute  with  H.  Thus  states  represented  by  solutions  of  the  form  of 
(8.17)  are  said  to  be  stationary  states.  However,  the  name  is  somewhat  mislead- 
ing. It  is  not  the  state  (8.17)  that  is  independent  of  time,  but  the  probability 
amplitude  (8.18). 

We  also  see  from  (8.17)  that  a stationary  state  has  a well  defined  energy. 
ti  is  a definite  energy  value  in  addition  to  being  the  expectation  value.  A 
determination  of  the  energy  of  a system  in  a stationary  state  yields  a particular 
value  of  £ and  only  that  value.  The  time-energy  uncertainty  relation  of  Postulate 
C,  where 

^E^t  > h 

implies  that  a state  with  a precise  energy  (A£  = 0)  is  possible  only  if  one  has 
unlimited  time  to  measure  same.  This  is  characteristic  of  stationary  states  in  view 
of  the  constancy  of  the  probability  amplitude. 


j 

i 

8.4  TIME-DEPENDENT  PERTURBATIONS;  STATE  TRANSITIONS 

In  the  discussions  thus  far  we  have  examined  mathematical  problems  that  can  be  | 

solved  exactly.  In  practice  problems  that  fit  the  theory  and  can  be  solved  exactly 

are  rare.  One  must  therefore  resort  to  methods  of  approximation.  A powerful 

methc-i)  of  approximation  is  found  in  perturbation  theory.  The  Hamiltonian  is 

fomied  in  two  parts;  one  is  large  and  characterizes  the  system  for  which  the 

Schrodinger  equation  can  be  solved  exactly,  the  other  is  small  and  acts  as  a 

perturbation.  Tliere  are  many  physical  problems  of  this  kind.  For  example, 

systems  which  exert  weak  forces  on  one  another,  or  time-varying  external  forces 

acting  on  a system.  The  system  reactions  can  be  described  in  terms  of  the 

unperturbed  states  of  noninteracting  systems  for  which  exact  solutions  can  be  , 

found.  This  technique  suggests  transitions  among  states.  Thus  perturbation 
theory  provides  a connection  between  the  observables  of  a system  and  its 
stationary  states. 

In  the  previous  section  we  discussed  isolated  systems  for  which  the  Hamil- 
tonian was  time  invariant.  The  system  state  as  a function  of  time  was  shown  to 
be 

|x(r)>  = 

i 

Tlie  amplitude  of  the  £th  state  is  1 

a*(r)  = <x^|x(r)>  = j 
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from  which  it  follows  that 

= Ic^l^  = const. 

If  a time  varying  force  acts  upon  the  system  the  situation  is  changed.  Tlie 
Hamiltonian  now  contains  a time  dependent  as  well  as  a time  invariant  part.  The 
coefficients  a,-  depend  upon  time  in  amplitude  as  well  as  phase.  Consequently,  it 
is  possible  that  certain  states  will  grow  and/or  decay  with  time.  A system  can 
change  its  character  under  the  influence  of  an  external  force;  the  force  produces 
1 ‘ transitions  from  ix,->  to  |x/>.  This  implies  tliat  the  system  energy  changes  from 

I to  the  difference  being  the  work  done  on  the  system  by  the  external  force. 

I Describing  an  external  force  as  an  additional  time-dependent  term  in  the 

I Hamiltonian  is.  at  best,  only  an  approximation.  For  purposes  of  this  discussion 

we  will  consider  only  the  first-order  term  of  perturbation  theory.  Also,  we  will 
assume  that  the  time-dependent  perturbation  is  weak  and  comes  about  as  a 
result  of  the  system  interacting  with  another  system.  Thus  the  Hamiltonian  can 
be  approximated  as 

H = hO  -I-  V(0  (8.19) 

where  is  the  Hamiltonian  of  the  unperturbed  system  and  V(f)  is  the  small 
interaction  or  perturbation  term.  The  superscript  will  be  used  to  identify  quan- 
tities associated  with  the  unperturbed  system.  Further,  we  assume  the  eigen- 
vectors |x,">  of  H**  are  known  and  satisfy  the  relation 

H^^lx,^)  = A^^lxP)  (8.20) 

where  the  corresponding  energy  eigenvalues  are  discrete.  If  the  initial  state  at 
■ time  to  is  expanded  in  terms  of  the  unperturbed  eigenstates  we  have  from  (8.1  7) 

r 

' lx(7o)>  = 

/ 

1 •• 

In  the  absence  of  any  disturbance,  we  have  for  all  time 
|x(/)>  = 


I 

r 


i 
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However,  in  the  presence  ol  V(/)  (8.17)  is  ;io  longer  the  correct  sohition  to  the 
Schrodinger  equation.  To  obtain  the  proper  solution  we  nuist  expand  ;x(/)>  at 
every  instant  of  time  with  the  amplitude  coelTicients  c,  depending  on  time: 

ix(/)>  = (8.21) 

i 

The  probability  amplitude  of  finding  the  system  in  the  ^th  unperturbed  state  is 

q(F)  = (8.22) 

To  readily  determine  the  manner  in  which  state  transitions  occur  we  ex- 
amine the  time  dependency  of  c>.  Substituting  (8.21)  into  the  equation  of 
motion 


/h  — |x>  = H|x>  = (h0+  V)|x> 


gives 


Forming  the  scalar  product  of  the  above  with  (x^Pl  results  in 

+ Z]c7(0<x/’|Vix">e"^''='^^^'’ 
i 

Since  the  lx/*)  form  an  ortlionormal  set  Eq.  (8.20)  gives 
<xO|Hix,o>  = f;,«(x0|x«>  = 
which  reduces  (8.23)  to 

/h^Q(r)  = Ec,(/)<x/’iV|x/V‘''^^'^-*''^’’ 
= Z]c7(/)Kt,e"^'^-'" 


(8.23) 


(8.24) 
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For  small  perturbations  the  C|^(t)  may  remain  small  througliout.  After  the  per- 
turbations have  stopped  the  system  settles  down  to  constant  values  of 
evaluated  at  / = +°°: 

Q(0  = --^J  (8.29) 

From  formula  (8.29)  it  is  seen  that  a system  influenced  by  a time-dependent 
perturbation  makes  transitions  to  other  energy  eigenstates  of  The  quantity 
|q(°°)I^  defines  the  transition  probability  from  state  i to  k.  It  is  proportional  to 
the  square  of  the  absolute  value  of  the  Fourier  component  of  the  perturbation 
matrix  element  F)ti- evaluated  at  the  transition  frequency  co)(,-;i.e., 

/oo  2 

Vkie-^^>“’dt 

30 

where  coyt/  is  deduced  from  the  relation 

COki  = ^ttf 


If  the  system  is  initially  in  the  higher  energy  state  k tire  transition  prob- 
ability to  the  lower  state  r is,  from  (8.29), 


It  can  be  shown  that  V is  a Hermitian  perturbation  operator,  and  that  C0y(.,-  = 
Thus  it  readily  follows  that 

c,(°°)  = (8.30) 

The  two  transition  probabilities  are  equal.  Property  (8.30)  is  the  condition  for 
detailed  balancing.  Tlie  energy  difference  E = heqt/  's  transferred  to  the  radiation 
field  of  the  system. 
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8.5  CONSTANT  PERTURBATION 

As  a matter  of  practical  interest  we  consider  the  case  where  V is  constant  or 
varies  slowly  over  the  period  I /to*,-  Tlie  system  is  in  an  initial  unperturbed 
eigenstate  |x,®)  at  time  fg  =0  and  then  subjected  to  a weak  perturbation  which 
persists  at  some  near  constant  value.  In  keeping  with  our  previous  discussion  the 
time  development  of  a system  with  Hamiltonian  H = + V can  be  conveniently 

described  in  terms  of  transitions  between  eigenstates  of  the  unperturbed  Hamil- 
tonian H^.  The  approximate  equations  (8.27)  apply: 


/hfr-,(r) 


Treating  V as  a constant  and  specifying  the  initial  and  final  discrete  states  as  / 
and  k.  respectively,  integration  of  (8.27)  gives 


<x/*|V|x,®> 


(1  - 


where 


c*(0)  = 0.  c,(0)  =1,  k ¥=  i 

¥ 

The  probability  that  the  system,  in  the  initial  state  i at  time  fg  = 0,  will  be  in  the 
final  unperturbed  eigenstate  k(ki=  i)  at  time  r is 


k*(r)l^  = 


4iK*,-|2 


(t'k 


■ 


Wkit 


(8.32) 


liquation  (8.32)  is  illustrated  graphically  in  Figure  8.1.  It  is  a periodic  function 
of  t with  a period  equal  to  tr/cu^^,-  and  a peak  at  LOki  - 0.  The  expression  is  valid 
only  as  long  as  f,(r)  can  be  approximated  as  c,U)  ^ 1.  during  which  time  the 
transition  probability  to  states  where  F,'^’  ^ k'^  remains  small  for  weak  perturba- 
tions. The  probability  of  finding  the  system  in  state  k is  small  unless  the  energy 
of  the  kth  state  is  close  to  the  energy  of  the  initial  state.  However,  transitions  to 
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Figure  8.1 


states  where  have  an  iniporlant  properly.  Wlien  /.j'  ci|uation 

(8.32)  can  be  approximated  as 

Thus  the  transition  probability  to  the  ^th  unperturbed  eigenstate  increases 
quadratically  with  time.  This  has  special  importance  when  the  slates  in  the 
neigliborhood  of  the  initial  energy  are  very  closely  spaced  and  constitute  a near 
continuum. 

It  is  not  physically  possible  to  measure  for  a single  value  of  A'.  The 
classical  measurement  is  of  the  tota/  probability  that  the  system  made  a transi- 
tion from  an  initial  to  a final  state.  We  define  the  total  transition  probability  to 
all  possible  final  states  as 


Transition  probability  = 


k 


where  tiie  summation  extends  over  all  final  states  under  consideration.  For  a 
quasi-continuum  of  energy  stales  per  unit  energy  level  we  introduce  the  density 
of  final  unperturbed  states  denoted  by  p(A').  Tlie  quantity  p{E)dE  measures  the 
number  of  final  states  in  the  interval  dE  containing  energy  E.  The  total  transi- 
tion probability  into  these  states  is  determined  by  multiplying  (8.32)  by 
p(E^)dE^  and  integrating  with  respect  to  dEj^  : 


I - cosoj^/r 
(A'i’  - E^)^ 


p(E^)dE° 


(8.34) 


I Fit, -I  = KxitIVIx,'’)! 


where 
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The  time  rate  ot' change  of'  the  total  transition  probability,  w,  is 

'V  = (8.35) 

In  analyzing  (8.35)  K*,  and  p are  reasonably  constant  over  a small  energy 
range  near  tf.  However,  sin  Ukitlu^ki  oscillates  rapidly  in  this  same  energy 
interval  for  all  t satisfying  the  relation 

t > h/6/t;^  (8.36) 

and  has  a pronounced  peak  at  -tf-  Clearly  those  transitions  which  tend  to 
conserve  the  unperturbed  energy  are  dominant.  Also  is  usually  comparable 
to  Thus  h/5/f®  is  a very  short  time.  Hence  there  is  a large  range  of  t where 
(8.36)  is  fulfilled  yet  the  initial  state  / is  not  appreciably  depleted.  During  this 
time  (8.35)  can  be  approximated  as 

vv=^r  lc*(0l'p(i'^^/C«  = ^lK,,|p(£'0)|  (8-37) 

-oo  — » 

Under  the  conditions  stated  the  transition  probability  per  unit  time  becomes 

w = ^IF*,1P(£’^  (8.38) 

Equation  (8.38)  shows  a constant  rate  of  transition.  This  result  comes  about 
because  we  summed  over  transitions  which  conserve  the  unperturbed  energy 
(t'k  =£',.®)  and  transitions  that  violate  this  conservation.  From  (8.33)  it  is  seen 
that  the  transition  probabilities  of  the  former  type  increase  quadratically  with 
time,  whereas  from  (8.32)  it  is  seen  that  transitions  of  the  latter  type  are 
periodic.  The  result  is  a compromise  between  these  two  and  the  transition  rate  is 
constant.  Result  (8.38)  has  been  termed  by  Fermi  as  the  golden  nile  of  time- 
dependent  perturbation  theory. 

8.6  EXPONENTIAL  DECAY 

Our  discussions  on  perturbation  theory  was  centered  around  the  approximate 
solutions  to  equations  (8.24): 

ih^CkU)  = LciUWkiel'^'^d 
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From  these  equations  it  was  seen  tiiat  if  ^ 0 transitions  from  an  initial  state 
to  various  available  final  states  occur.  The  probability  that  the  system  will  make 
a transition  in  time  interval  between  t and  t + cJt  is  equal  to  wdt.  By  the  condi- 
tion for  detailed  balancing,  (8.30),  these  final  states  contribute  to  the  probabil- 
ity amplitudes  of  the  initial  states  througli  a feedback  process.  As  the  amplitudes 
of  the  final  states  grow,  they  do  so  at  the  expense  of  the  initial  states,  since 
probability  is  conserved.  Because  of  the  different  frequencies  co*/  of  the  feed- 
back process  the  contributions  made  by  the  amplitudes  to  amplitudes  C/ are 
all  of  different  phases.  Thus  if  there  are  many  available  states  k,  fonning  a 
near-continuum,  the  contributions  made  by  these  states  tend  to  cancel.  This 
destructive  interference  in  the  probability  amplitude  is  interpreted  as  a gradual 
(exponential)  depletion  of  the  initial  state. 

It  is  inferred  above  that  the  probability  of  finding  tlie  system  at  time  / still 
in  the  initial  state  is  proportional  to  exp(-wt).  This  is  the  exponential  decay  law. 
To  derive  the  exponential  decay  law  we  no  longer  consider  C/(t)  on  the  right- 
hand  side  in  (8.24)  as  a constant.  However,  in  our  approximation  we  will  con- 
tinue to  neglect  all  other  contributions  to  the  change  in  ci^it).  Equation  (8.24) 
becomes 


/■h^fArCO  = K*,cv(/')e^‘^'^'''  k*i 

Ck(t)  = Ci(t')e'^ki'd,’  (8.39) 

•^0 

where  c;t(0)  ~ 0 for  fc  =?^  /.  The  prime  identifies  t with  Ck-  However,  by  reciprocal 
manipulation  of  (8.24)  it  can  be  shown  that  the  equation  of  motion  for  c,(r)  is 
rigorously 


= li  Vi,iCkit)e>'^‘k'  + ViiCiit)  (8.40) 

k*i 

Substituting  (8.39)  into  (8.40)  gives  ^ 

^c,(0  = ~ _ IviiCiU)  (8.41) 


Assuming  that  the  pertinent  final  states  are  in  a near-continuum  with  a density 


I 

[ 


r 
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of  states  the  sum  can  be  replaced  by  an  integral  extending  over  all 

possible  transition  frequencies.  Equation  (8.41)  becomes 


/oo  ^ ( 


(8.42) 


Equation  (8.42)  is  a differential  equation  of  the  Volterra  type.  It  can  be  approxi- 
mated by  the  simple  equation* 


_ _4£'P 
2 li  ' 


c/(0 


(8.43) 


where  5/f®  is  the  shift  of  the  unperturbed  energy  level  A’**  due  io  second-order 
perturbation.^  With  c,(0)  = 1 equation  (8.43)  yields 

c,(f)  = exp  y ■ 

which  describes  the  exponential  decay. 


We  next  examine  the  probability  that  the  system  has  decayed  into  state  it. 
Substituting  (8.44)  into  (8.39)  and  integrating: 


j r' 

Cki<)  = exp 


1 - exp  ^ - h — / j exp 


.Yf,n5£}0-£-0-/hf  r' 
^(£;P+5£-0-fO)r 


dt' 


El^  - (E9+6E,°)  + ihf 


The  probability  \ck\^  that  the  system  has  decayed  into  state  k becomes 

/r  \ ! E^  + 5£'.®  - E?\ 

1 - 2exp(^f)cos(-* jt  + exp(-rr) 

\ck'?  = Wki\^ (8.45) 

(£-0-£-P-5£;.V  + -T 


1.  See  V.  F.  Weisskopf  and  E.  P.  Wigner.  Z.  Physik.  63,  54  (1930). 

2.  The  reader  is  reminded  that  our  discussion  on  perturbation  addressed  only  first-order 
terms.  An  exception  is  made  in  the  derivation  of  equation  (8.43). 


J 
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where  l'  = hw.  For  t long  compared  to  the  state  lifetime  1/F  the  transition 
probability  is  approximated  as 


Equation  (8,46)  is  a bell-shaped  curve.  It  has  a pronounced  peak  at  those  final 
state  energy  levels  equal  to  tf  -F  . The  width  of  the  curve  is  equal  to 
hu’. 
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Appendix  A 

DIRAC  DELTA  FUNCTIONS 


The  delta  function  5(g  - a)  is  a mathematically  improper  function  having  the 
following  pioperties  (in  one  dimension): 


6(1 -a)  = 0 

for  ? ^ a 

(A.l) 

*oo 

5(?-a)r/?  = 1 

if  region  of  integration 

(A.2) 

• oe 

includes  ? = a 

f 5(?-a)  = 0 

if  region  of  integration 

(A.3) 

does  not  include  | = a 
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Higlier  order  delta  functions  are  defined  as  the  derivative  of  6(^  - a); 

c/"6(|-a) 


6"«-a)  = 


dt" 


(A.4) 


Tlie  delta  function  of  another  function  can  be  transformed  according  to  the  rule 


H/m  = 


fA.5) 


where  f(Xo)  = Q.  For  an  aribtrary  function  /(?)  the  sifting  property  of  delta 
functions  provides  for  the  integral  equations 


J = fi“) 


(A.6) 


(A.7) 


It  is  difficult  to  physically  imagine  the  delta  function  or  the  unit  impulse 
as  it  is  also  referred-to.  Qualitatively,  it  can  be  thought  of  as  a small  pulse  of  high 
magnitude  and  infinitesimally  small  duration  (see  Fig.  A-1).  We  require  that  as 
the  peak  of  the  curve  gets  higher,  the  width  gets  narrower  in  such  a way  that  the 
area  under  the  curve  remains  constant  (unity).  Thus  the  unit  impulse  5(r)  can  be 
regarded  the  limit  as  0 of  the  pulse  p(/)  having  width  A^  and  height  1/AJ. 


r 

R 


Figure  A-1  Unit  Impulse. 
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Appendix  B 

RESOLUTION  OF  CONTINUOUS-TIME  SIGNALS 
INTO  UNIT  IMPULSES 


An  arbitrary  signal  u(t)  can  be  approximated  in  any  finite  time  interval 
7 by  a finite  number  of  unit  pulses  of  width  AJ  occurring  at  times 
t = where  A:  = ±1,±2,...,±A  = 7/AJ.  Fig.  B-l  illustrates  the  idea.  Since  the 
heiglrt  of  the  unit  pulse  is  1/A|,  the  pulse  at  / = is  multiplied  by  u(kA^)A^ 
thereby  resulting  in  the  amplitude  u{t).  The  approximation  of  u{t)  can  be 
written  as 


N 

u(t)  = H u(kA%)A^p{t  - kAk)  (B.l) 

k=-N 


= \[m  u(kA%)A^p{t  - kA^i)  (B.2) 


where  p{t)  is  defined  in  Appendix  A.  In  the  limit  as  A|  -»■  0 and  the  pulses 

become  impulses  and  the  summation  becomes  an  integral.  Thus  the  approxima- 
tion becomes  exact.  We  have 


u(t)  = 


-7 


(B.3) 


over  the  finite  time  interval  (-T.T).  Extending  the  integral  over  the  entire  time 
domain  defining  «(?),  i e.,  letting 


u(t)  = 


(B.4) 


Thus,  any  continuous-time  signal  u(t)  can  be  resolved  into  a continuum  of  unit 
pulses.  It  follows  that  the  response  of  a linear  system  to  excitation  u(t)  can  be 
readily  found  if  the  response  to  the  unit  impulse  is  known.  Hence  the  unit 
impulse  response  completely  characterizes  the  system. 
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Appendix  C 

DISCRETE-TIME  STATE  EQUATIONS 


From  (2.1)  the  standard  form  of  the  nth  order  difference  equation  can  be 
written  as 


OoAk)  + a^yik  - 1 ) + . . . + a„\\k  - n) 

= b^u(k  ~ 1 ) + . . . + bj«(A'  - s) 

(C.l) 

{aQ+a^A~' + ...  + a„A~")y(k)  = {b^A  ^ + ...+h^A  ^)u{k) 
where  A is  the  linear  advance  operator  defined  for  any  interger  i>  as 

(C.2) 

A''f(k) 

= f(k  + 1^) 

(C.3) 

Introducing  the  variable  v(k)  where 

(flo  + a,/l  ' + ... 

a„A~”)v{k)  = u(k) 

(C.4) 

and 

. +b,A-^)v(k)  = y(k) 

(C..S) 

APPENDJXES 

equation  (C.2)  gives 

!;(*)  = — - a„_^v(k  - n + 1)  - .. 

“O 

Specifying  the  elements  of  x(k)  as 

x^(k)  = v(k-n) 

X2(k)  = v(k  - /I  + 1 ) 


x„(k)  = v(k-\) 


we  can  write 


+ 1)  = xjik) 

X2(k  + 1)  = x^ik) 

x„^^(k+\)  = x„{k) 

x„(k  + 1)  = — [w(^)  " a^Xy(k)  - ...  - 

Aq 

In  matrix  form  (C.8)  becomes 

x(it  +1)  = Ax(ft)  + Bu(A:) 

where 


r 

0 1 

0 

0 

k 

r 

0 0 

1 

0 

f 

A = 

a» 

“n-\ 

a\ 

1 

ao 

"0 

F-rom  (C.5)  we  have 

,.(*)  = h^v{k-  1)  + b2v(k-2)  + ...  + h^v(k-s) 
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Substituting  (C. 7)  into  (CM  1)  gives 

y(k)  = b^x„{k)  + b2X„_i(k)  + ...  + V„  + ,_,(Ar)  (C.12) 

or,  in  matrix  form. 


y{k)  = Cx(^)  + Du(7c)  (C.13) 

where 

C = i.v„  . b,  I D = 0 (C.14) 

Matrices  (C'.IO)  and  (( '.  14)  are  not  unique.  Tliey  are  one  of  a variety  of  ways  to 
represent  (C'.9)and  (CM3). 


Appendix  D 
TRANSFORMS 


Listed  below  are  transfor^is  of  some  of  the  more  commonly  used  mathemati- 
cal functions: 


S[/(^)l 

= F(z)  = H f{k)z  * 

k > 0 

A:=0 

Sla*] 

1 

t -1 

iz\  > Ifll 

\ ~ az  ^ 

II 

1 

e 

Izl  > 1 

J[cos  <jjk\ 

_ 1 - z~*costo 

1 - 22~*COSCO  + z~^ 

Ul  > 1 

J 
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SIW)1 

Z[k] 

S[*^l 

sr'i 


S (/(*  + !)] 


(1  - z->)2 
-log(l  - z"') 

gUz 

z[F(z)-/(0)l 


|z|  > 1 
|z|  > 1 

k>  I |zl>l 

Izl  >0 
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ways.  Listed  below  are  a few  elementary  transform  terms  useful  in  expansion  by 
partial  fractions: 

Time  sequence 


1 

z - a 

1 . F(z)  converges  |z|  > a 

2.  F(z)  converges  |z|  < a 

z 

(z  - fl)2 

z{z  + a) 
(z  - a)3 

i 

z(z^  +4az  +a^) 
(z  - af 

r 

Appendix  E 

ANALOGOUS  QUANTITIES  OF  CONTINUOUS-TIME 
AND  DISCRETE-TIME  SYSTEMS 

QUANTITY 

CONT.-TIME  SYSTEM 

DISCRETE  TIME  SYSTEM 

■ 

Fundamental 

matrix 

^r)  = e*' 

= A* 

Transform  of 
fundamental  matrix 

1 

II 

> 

4>(z)  = (1  - Az"’)“’ 

Impulse  response 
matrix 

H(t)  = C$(f)B  + D6(f) 

H(ifc)  = C4>(A:-1)B  (k>l) 
H(k)  = D (it  = 0) 

Transfer  function 
matrix 

H(s)  = C4>(s)B  + D 

H(z)  = Cz~'<I>(z)B  + D 

I 

I 
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Appendix  F 

STOCHASTIC  PROCESSES 


Assume  that  a given  experiment  is  performed  on  a system  in  which  outcomes  f 
are  measured  (see  Figure  F.I).  Let  the  outcomes  f form  the  space  S.  Associated 
with  S are  subsets  called  events  and  the  probabilities  of  these  events.  To  every 
outcome  f we  can  assign  the  time  function  The  set  of  functions 
one  for  each  f,  is  called  a stochastic  process.  For  a specific  outcome  f/  the 
expression  is  a single  time  function.  For  a specific  time  f/,  {«({■,//)} 

depends  on  f and  is  a random  variable.  is  a number.  Where  it  is 

understood  that  f is  the  random  variable  of  the  random  process  we  shall 

use  the  notation  u(t)  to  represent  the  stochastic  process: 

= a(0  (F.i) 

The  notation  u is  used  to  distinguish  a sample  function  of  a random  process 
from  the  function  u established  in  Chapter  1 . 

By  repeating  the  experiment  n times  we  obtain  n sample  functions  as 
shown  in  Figure  F.I.  For  a specific  instant  of  time  /,•  we  observe  the  values  f < a 
and  denote  this  total  number  of  trials  as  rif(a).  Accordingly,  we  establish  the 
distribution  function  F(a;r)  as: 


F(a,t) 


Wf(a) 

n 


= <a} 


(F.2) 


'l  '2 

Figura  F.I  Sample  functions  of  a random  process. 
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F{a,t)  is  the  probabiJity  of  the  event  consisting  of  all  outcomes  f 

such  that  at  time  /,•  the  functions  ufj)  do  not  exceed  the  value  f = a.  Equation 
(F.2)  expresses  the  first  order  distribution  of  the  process  u(t).  The  density  of 
F(a\t)  is,  simply, 


dF(a\t) 


= /(«;0 


Similarly,  given  two  instants  of  time  t\  and  t2  where  the  corresponding 
values  of  f are  aj  and  ai,  respectively,  we  form  the  joint  distribution  function 
F{ay,a2\t\,t2)  as  follows: 

F(ai,a2;tt,f2)  = F{«(ri)<ai,M(r2)<a2}  (F.4) 

Expression  (F.4)  is  the  second  order  distribution  of  the  process  u(f).  The  corre- 
sponding density  is 

bF{ax,a2\ti,t2) 

(F.5) 

Although  it  is  not  entirely  correct  we  will  state  that  for  “practical”  pur- 
poses a real  stochastic  process  can  be  statistically  determined  by  its  nth  order 
distribution  function 

F(ai,a2,...a„;ri,f2,.  -.fn)  = F{M(ri)  < aj , «(r2)  < a2,... M(f„)  < a„} 

The  nth  order  density 


/(ai,a2,...,a„;ri,f2 t„) 


is  determined  by  differentiating  with  respect  to  all  variables  a,-  (/  = l,...,n). 

The  mean  value  of  the  process  u(f)  is  the  expected  value  F{u(t)}  of  the 
random  variable : 

F{u(0}  = F{«(f,0} 


af(aj)da 


where  f(a,t)  is  the  probability  density  function  associated  with  the  value  f = a. 
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and  f{a,t)da  is  the  probability  of  finding  the  value  a within  a ± da.  Therefore,  it 
must  follow  that: 


£ 


f{a,t)da  = 1 


(F.7) 


The  auto  correlation  R(t\,t2)  of  a process  «(/)  is  the  joint  moment  of 
{w(f.  7j)}  and  t2)},  i e.,  of  u(ti)  and  «*(72)' 


= I a\a2f(a\,a2\t\,t2)da\da2  (F-8) 

•'—go 

where  • denotes  the  conjugate.  For  real  processes  u(t)  = u*(t). 

A stochastic  process  is  said  to  be  stationary  in  the  strict  sense  if  its  statis- 
tics are  independent  of  a shift  in  the  time  origin,  i.e.,u(t)  and  u(t  + 6)  have  the 
same  statistics  for  all  5 . Similarly,  u(t)  is  said  to  be  stationary  in  the  wide  sense  if 
its  expected  value  or  mean  is  a constant  and  its  autocorrelation  function  depends 
only  on  t2~  t\  '. 


£'{«('■)}  = nit)  ~ const.  (F.9) 

R(f,,r2)  = R{t2-t\)  = ««)  = E{u{t-H)u{t)} 

= u{t-l)u{t)  (F.IO) 

where  it  is  understood  that  t establishes  a fixed  time  and  t varies  over  the  time 
interval  of  interest. 

The  process  u(t)  is  said  to  have  uncorrelated,  independent  or  orthogonal 
increments  if  and  M(t,+i)  is  a sequence  of  uncorrelated,  independent  or 
orthogonal  intervals. 

Given  two  real  or  complex  processes  u(t)  and  z^f)  the  cross-correlation  of 
the  two  processes  is  defined  as 

Euv(‘ld2)  = £'{«(fl)»*02)} 


M(fl)i'*(f2) 


(F.II) 
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The  two  processes  are  said  to  be  orthogonal  if 

= 0 (F.12) 

Similarly,  the  two  processes  are  uncorrelated  if  their  cross-covariance  Cu„  is 
zero,  i.e.,  where 

(^uvUldl)  = l^uv{tld2)  - £'{M(fl)}^;'{£*(f2)} 

= Iiuv(t\d2)  - u(t\)^t2) 

= 0 (F.13) 

It  follows  that  for  two  processes  to  be  uncorrelated 

f^uvU]d2)  = u(t])vHt2)  (F.14) 

Lastly,  the  two  processes  are  independent  if  |w(/i) «(fn)l  is  independent  of 

Ii!(fi), for  any  f] t„,t\ /*. 

A real  stationary  stochastic  prrKess  u(t)  is  said  to  be  ergodic  if  all  of  its 
statistics  can  be  determined  from  a single  ::ample  function  u(ij.t).  Since  many 
statistical  parameters  can  be  expressed  as  time  averages  it  can  be  said  that  u(t)  is 
ergodic  if  the  time  averages  are  equal  to  the  ensemble  averages,  i.e.,  if 

<M(frO>  = <m(0>  = F;{M(f,/)}  = /•.{M(r)}  (F.I5) 

where  <m(5/./)>  is  the  time  average  of  the  sample  function  We  Imply  in 

(F.IS)  that  the  statistics  for  u(r)  are  the  same  for  all  sample  functions  ulf,./!. 
Therefore, 


<u(f.r)>  = <m(M> 

Below  we  examine  some  of  the  conditions  under  which  (F.  1 5)  holds  true 
Given  a stationary  stochastic  process  u(t)  the  limits 


<M(r)>  = lim  f u(t)dt  (F  I ill 

r,.2rj~ 

1 r' 

K(^)  = ^im  u(t  - i)u(t)dt 


(F.I7) 
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where  T is  the  interval  define  the  mean  and  autocorrelation  as  time 

averages.  Form  the  ergodicity  theorem  every  function  u(f,/,)in  (F.16)  defines  a 
number.  Thus,  <m(0)  is  a random  variable.  The  random  variable  <«(f)>is  equal  to 
the  constant  £'{m(0}  = «W  only  'f  Us  variance  is  zero.  To  determine  this 
variance  we  form  the  finite  average 


which  has  the  mean 


■1-r 

£’«m(0>}  = ^ J E{u(t)}<it 


On  forming  the  second  moment  h can  be  shown  that  the  variance 

of  the  time  average  (uft))  is 


o2  = yj  (l- 


= - Im(OI^  (F-21) 

If  (F.20)  tends  to  zero  as  r->°°  then  u(l)  is  ergodic  with  respect  to  its  mean. 
Thus, 

<u(t)>  = = W) 

Using  (F.17)  it  can  be  shown,  in  a manner  similar  to  that  above,  that  the 
process  uft  - ^)u(t)  is  ergodic  with  respect  to  its  autocorrelation  function 


R(^)  = lim 
T’-o 


k)u{t)dt 


= £'{u(r-|)n(0}  = 


I,  , H A ^ 


190 


AN  INTRODUCTION  TO  THE  THEORY  OF  LINEAR  SYSTEMS 


if  the  limit 


lim 

T’ —*■00 


C(-^) 


[R{r)-R%)\d!i  = 0 


(F.24) 


where 


R(t)  = E{u(t + T)u(t  + T)u(t  + 


(F.25) 


The  power  spectrum  or  spectral  density  of  a process  u(t)  is  the  Fourier 
transform  5(w)  of  its  autocorrelation  function  /?(|); 

5(w)  = [/?«)] 

= J R{^)e->^^d^  (F.26) 

From  the  inverse  transform  /?(?)  can  be  expressed  in  terms  of  S(u)  as 

^oo 

m = ^ j S(u:)ef‘^^daj  (F.27) 


Setting  ^ = 0 in  (F.27)  gives 

/oe 

S(o:)dio  = £'{|n(0l^}  > 0 (F.28) 

oe 


The  power  spectrum  S(co),  as  a time  average  (ergodicity)  of  the  process 
M(r),  is  defined  as  the  limit 


S(u)  = lim  — 

/■-►OO  it 


1 


u(t)e~l'^^dli\ 


of  the  random  power 


■ 2T 


"'-t 


u{t)e~’'^^d!i\ 


(F.29) 


(F.30) 


For  (F.29)  to  hold  true  we  demand  that  S(to)  tend  to  S(w)  and  its  variance  tend 
to  zero  as  r 
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